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ABSTRACT  (Continued) 

ith  an  attempt  to  quantify  subjective  responses  to  sounds,  especially 
in  the  discrimination  between  similar  sounding  noise-like  stimuli.  A 
model  of  the  discrimination  process  as  one  of  feature  extraction  and 

rtf is j nn m king  about  the  presence  of  salient  features  is  proposed.  ^ 

The  model  and  tHe  signal  patterns -used-  in  the  experiments  are~restncted 
to  features  of  the  sounds  which  are  dichotomous,  i.e.,  present  in  one 
sound  pattern  and  absent  in  the  other  of  a pair  of  sounds  to  discrimin- 
ate. 

^Experiments  using  both  marine  sounds  and  laboratory  generated  signals 
were  conducted  using  trained  University  student  listeners  and  Navy 
personnel.  Two  experimental  methods  were  applied  with  one  using  a 
modification  of  classical  threshold  techniques.  The  other  procedure 
used  a continuous  rating  scale  to  elicit  information  about  the  confi- 
dence of  the  listener's  discrimination  decision. 
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The  major  portion  of  the  data  collected  was  for 'The  case  where  the 
dichotomous  feature  was  a band  of  noise.  This  features  results  either 
from  filtering  a marine  sound  or  from  deleting  a bano^limited  component 
from  laboratory  generated  sound  patterns.  The  data  waV  analyzed  from 
the  point  of  view  of  hypothesis  tests  on  the  presence  of  the  feature. 
Data  available  from  signal  detectability  experiments  was  used  to 
predict  the  required  signal-to-noise  ratio  to  make  a discrimination 
decision.  Good  agreement  was  found  between  this  prediction  and 
experiment  results. 

The  second  dichotomous  feature  was  that  associated  with  quasi-periodicj 
variations  in  Ihe  amplitude  of  the  sound.  The  results  for  this  feature 
also  show  good  agreement  with  predictions.  The  data  for  two  marine 
sounds  and  a third  laboratory  generated  one  exhibit  consistent  listener 
performance . 


Results  reported  in  the  thesis  cover  a significant  number  of 
discrimination  tasks  using  complex  sounds.  The  findings  support  the 
model  of  the  process  as  one  of  feature  extraction  and  subsequent 
decision  making  based  on  the  detectability  of  the  features.  A number 
of  areas  requiring  further  study  were  identified,  and  although  some 
anomolies  were  noted,  the  work  provides  the  basis  for  future  detailed 
studies  to  further  quantify  the  processes  involved  in  the  recognition 
of  noise-like  sounds  by  humans. 
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Whereas  the  factors  important  in  the  understanding  of  speech 
waveforms  in  the  presence  of  extraneous  noise  or  when  distorted  are 
reasonably  well  understood,  factors  important  in  recognizing  noise- 
like sounds  are  not.  Many  of  the  noise-like  sounds  to  which  man  is 
exposed  serve  to  convey  useful  information  and,  under  proper  circum- 
stances, allow  the  listener  to  make  inferences  about  the  source  The 
ability  of  the  listener  to  react  to  differences  between  such  sounds 
is  significant  in  a number  of  industrial  situations. 

This  thesis  deals  with  an  attempt  to  quantify  subjective 
responses  to  sounds,  especially  in  the  discrimination  between  similar 
sounding  n:ise-like  stimuli.  A model  of  the  discrimination  process 
as  one  of  feature  extraction  and  decision  making  about  the  presence 
ct  salient  features  is  proposed.  The  model  and  the  signal  patterns 
used  in  the  experiments  are  restricted  to  features  of  the  sounds  which 
are  dichotcmous,  l.e  , present  in  one  sound  pattern  and  absent  in  the 
other  ot  a pair  cf  sounds  to  discriminate. 

Experiments  using  both  marine  sounds  and  laboratory  generated 
signals  were  conducted  using  trained  University  student  listeners  and 
Navy  personnel.  Two  experimental  methods  were  applied  with  one  using 
a modification  of  classical  threshold  techniques.  The  other  procedure 
used  a continuous  rating  scale  to  elicit  intormatlon  about  the 
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confidence  ot  the  listener’s  discrimination  decision. 
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The  major  portion  of  the  data  collected  was  for  the  case  where 
the  dichotomous  feature  was  a band  of  noise.  This  feature  results 
either  from  filtering  a marine  sound  or  from  deleting  a band  limited 
component  from  laboratory-generated  sound  patterns.  The  data  was 
analyzed  from  the  point  of  view  of  hypothesis  tests  on  the  presence 
of  the  feature.  Data  available  from  signal  detectability  experiments 
were  used  to  predict  the  required  signal-to-noise  ratio  to  make  a 
discrimination  decision.  Good  agreement  was  found  between  this 
prediction  and  experiment  results. 

The  second  dichotomous  feature  was  that  associated  with  quasi- 
periodic  variations  in  the  amplitude  of  the  sound.  The  results  for 
this  feature  also  show  good  agreement  with  predictions.  The  data  for 
two  marine  sounds  and  a third  laboratory-generated  one  exhibit 
consistent  listener  performance. 

Results  reported  in  the  thesis  cover  a significant  number  of 
discrimination  tasks  using  complex  sounds.  The  findings  support  the 
model  of  the  process  as  one  of  feature  extraction  and  subsequent 
decision  making  based  on  the  detectability  of  the  features.  A number 
of  areas  requiring  further  study  were  identified,  and  although  some 
anomalies  were  noted,  the  work  provides  the  basis  for  future  detailed 
studies  to  further  quantify  the  processes  involved  in  the  recognition 
of  noise-like  sounds  by  humans. 


CHAPTER  I 


INTRODUCTION 


1 1 Statement  of  the  Pioblem 

Many  cf  the  noise-like  sounds  to  which  man  is  exposed  serve 
also  to  convey  useful  information.  Under  proper  conditions,  such 
ncises  allow  the  listener  to  identify  the  source  and  to  make 
inferences  about  it.  The  ability  to  react  to  differences  between 
such  sounds  is  of  considerable  significance  in  a number  of 
situations  ranging  from  the  esrimation  of  a vehicle's  speed  to  the 
adjustment  of  the  cutting  rate  of  a lathe.  This  thesis  deals  with 
an  attempt  to  quantify  subjective  responses  to  sounds,  especially 
in  the  discrimination  between  similar  sounding  noise-like  stimuli. 
Relevant  knowledge  is  reviewed  and  signal  detectability  is  used 
as  the  basis  for  a model  of  the  discrimination  process.  An 
experimental  method  is  described  and  the  results  of  studies  using 
University  students  and  Navy  sonar  operators  are  presented. 

1 2 Importance  of  the  Problem 

The  ability  cf  a listener  to  recognize  noise-like  sounds,  to 
respond  tc  changes  of  such  sounds,  and  to  discriminate  between 
complex  sounds  plays  a role  in  man's  everyday  life  (Sheridan  and 
Ferrell,  1974i.  in  industrial  environments,  this  capability  is 
viral  to  the  saiety  and  often  the  productivity  of  the  worker 
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(Murrell,  1965).  Efforts  to  protect  workers  from  excessive  industrial 
noise  have  resulted  in  mandated  use  of  hearing  protectors  in  many 
industrial  situations  (EPA,  1972).  The  effect  of  these  protectors 
on  the  ability  of  the  worker  to  respond  to  industrial  noises  can  be 
determined  if  a thorough  knowledge  of  the  auditory  process  is  available. 
In  addition,  future  monitoring  of  industrial  processes  may  rely  on 
automated  systems  for  sonic  processing  to  remove  humans  from  areas 
with  excessive  noise  levels.  Such  automatic  recognition  and 
discrimination  devices  may  well  use  concepts  derived  from  speech 
recognition  and  understanding  systems  (Lesser  et  al . , 1975; 

Myasnikov  and  Myasnikova,  1970).  In  order  to  make  this  transfer, 
a quantitative  knowledge  of  the  factors  which  allow  humans  to 
distinguish  between  noise-like  sounds  will  be  needed. 

A large  body  of  literature  is  concerned  with  the  processes 
involved  in  speech  recognition  and  discrimination  (Licklider  et  al., 
1948;  Pollack,  1948;  Stevens  and  House,  1972).  However,  the  important 
aspects  of  noise-like  sound  recognition  have  not  been  investigated 
systematically. 

1 . 3 Approach 

This  thesis  describes  an  experimental  approach  designed  to 
elicit  listener  decisions  about  which  of  two  trained  exposure  sound 
patterns  is  subsequently  presented  as  a probe  in  a noise  background. 

The  probe  is  slowly  brought  out  of  the  noise  until  the  listener  has 
sufficient  information  to  make  a decision.  The  signals  used  were 
either  marine  sounds  or  laboratory-generated  sounds  which  have 
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features  in  common  with  the  natural  ones.  The  two  sounds  diftered  in 
the  presence  or  absence  of  a recognizable  component  or  ieature.  The 
experiment,  therefore,  deals  with  the  performance  of  listeners  in 
distinguishing  between  two  sounds  which  differ  by  virtue  cf  a 
dichotomous  feature. 

A model  of  the  discrimination  performance  m terms  of  a simple 
h>pcthesis  test  about  the  presence  of  the  salient  feature  is 
presented  in  Chapter  II.  There,  the  case  of  a dichotomous  feature 
which  is  a band-limited  noise  is  treated  in  greater  detail  making 
use  of  signal  detection  concepts  and  Bayesian  hypothesis  testing. 

The  concept  is  then  expanded  in  Chapter  III  to  the  decision  process 
involved  in  sequential  hypothesis  tests  where  the  listener  has  the 
option  tc  defer  the  terminal  decision.  It  Is  seen  that  the  criterion 
the  listener  uses  is  a very  significant  lactcr  in  the  decision 

The  design  cf  the  experiment  is  discussed  in  Chapter  IV. 
Pre-recorded  test  sequences  were  used,  and  the  subjects  used  either 
an  A/B  response  recorder  to  signal  a terminal  decision  or  a 
continuous  rating  scale  tc  report  their  degree  or  contidence  and 
tentative  disc rimination  decision  as  a function  cf  time.  Then,  in 
Chapter  V,  the  raricnale  for  choosing  certain  sound  patterns  is 
explained.  The  decision  to  use  marine  sounds  was  based  primarily 
on  the  fact  that  such  sounds  are  of  known  significance  in  auditory 
discrimination,  have  many  characteristics  in  common  with  sounds 
occurring  in  various  industrfal  settings,  and  were  expected  to  be 
familiar  tc  the  Navy  personnel  who  were  tc  serve  as  subjects  in 
some  of  these  experiments.  The  dichotomous  features  explicitly 
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addressed  were  bands  of  noise  and  temporal  shifts  in  amplitude. 

Chapter  V also  describes  the  training  of  subjects  and  controls 
exercised  to  insure  listener  safety. 

The  discrimination  of  sound  patterns  with  bands  of  noise 
dichotomous  was  one  of  the  principal  topics  of  mvestigar ion 
Experimental  data  collected  by  others  about  the  detectability  of  noise 
bands  provides  the  basis  for  comparing  a model  of  the  discrimination 
task  as  one  of  feature  extraction  and  detection  to  the  observed 
results.  The  data  is  presented  in  Chapter  VI  along  with  analyses  of 
the  results  from  the  point  of  view  of  signal  detectability.  The 
results  show  that  the  simple  model  is  adequate  for  predicting 
performance  m some  of  the  experiments  using  complex  marine  sounds. 

The  model  must  be  extended,  however,  whenever  the  experiment  is 
designed  to  elicit  discrimination  decisions  with  the  probe  signal 
pattern  for  which  the  dichotomous  feature  is  absent. . For  this 
situation,  other  auditory  clues  must  be  used  in  the  decision  process. 

Experimental  results  for  sound  patterns  which  differ  in  their 
temporal  char acr er istic s are  presented  in  Chapter  VII.  Ihe  dichc tcmous 
feature  is  in  that  case  the  time  varying  amplitude  of  the 

waveform.  This  dichotomous  feature  also  resulted  in  listener 
discrimination  performance  which  can  as  a first  approximation  be 
explained  as  a feature  detection, 

The  experimental  technique  used  does  introduce  some  procedural 
biases  which  must  be  taken  into  account  whenever  the  results  are 
analyzed.  These  biases  and  anomalies  are  discussed  in  Chapter  Vlll. 
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One  oi  the  most  difficult  interactions  to  take  into  account  in 


experiments  as  this  one  is  that  of  time  on  the  memory  of  the  listener 
and  also  on  his  criterion.  There  is  strong  evidence  to  indicate  that 
even  trained  Navy  sonar  operators  find  it  difficult  to  maintain  a 
constant  decision  criterion  in  tests  where  the  signal-to-noise  ratio 
is  slowly  increased  with  time.  The  results  of  continuous  rating 
experiments  is  presented  in  Chapter  VIII  as  a means  for  mapping  out 
the  receiver  operating  characteristic  (ROC)  in  a sequential 
discrimination  task.  The  effects  of  detectability  of  the  dichotomous 
feature  and  listener  confidence  are  seen  to  yield  results  consistent 


with  expectations. 


CHAPTER  II 


THE  AUDITORY  RECOGNITION  TASK 

2 . 1 General 

In  this  chapter,  the  problem  pcsed  by  the  need  of  a listener  to 
recognize  an  auditory  stimulus  is  investigated.  Such  recognition 
implies  that  the  listener  has  first  somehow  been  convinced  of  the 
presence  of  a sound  source  and  must  now  decide  which  of  a number  of 
possible  classes  of  sounds  the  source  represents.  In  practice,  the 
processes  cf  detection  and  classification  are  closely  related.  This 
relationship  will  be  seen  to  be  especially  intimate  when  the 
classitication  reduces  to  distinguishing  between  two  sounds  which 
difrer  in  certain  precisely  defined  ways.  Also,  in  most  cases  or 
practical  interest,  the  sound  will  be  heard  in  the  presence  of 
exr -anecus  noises  which  will  tend  to  mask  the  presence  cf  the  sound 
of  .merest  as  well  as  to  oontound  the  recognition  or  classitication 
task 

figure  1 shows  the  essential  elements  of  the  recognition  task 
in  a concise  way  using  a general  estimation  model  (Van  Trees,  1968) 
modified  to  include  some  concepts  from  pattern  recognition  (Fu,  1968). 
The  model  consists  ot  the  following  four  components: 

a.  Source  Characterization.  An  acoustic  source  is  characterized 


by  a set  ot  parameters.  These  parameters  a define  a 
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multi-dimensional  parameter  space  V and  they  may  be 
associated  with  such  features  of  the  sound  as  the  presence 
of  a band  of  noise,  amplitude  modulation,  cr  temporal  shifts 
in  the  sound's  spectrum.  Denote  the  acoustic  pattern 
corresponding  to  acoustic  source  3 as  u.^  and  the  features 
as 

ij 

b.  Observation-  The  source  parameters  map  into  an  observation 


space  0 according  tc  some  probabilistic  law.  The 
observable  R is  the  listener's  response  to  the  stimulus. 


This  observation  includes  the  effect  of  the  auditory 
transducer  as  well  as  subsequent  neural  and  higher  level 
cognitive  processes. 

c.  Feature  Extraction.  It  is  hypothesized  that  the  listener 
perceives  the  set  ot  observables  as  features  (Zhukov  and 
Chr istovitch,  1974).  These  subjective  features  may  net  be 
equivalent  to  features  above,  however,  and  can  be  the 

result  of  physiology  cr  cognition  influenced  by  listener 
experience  and  training  (Plomp  and  Smcoienburg,  1970; 

Reed,  1973;  Ahumada  et  al.,  1975  •.  An  N dimensional 
observation  space  reduces  tc  an  i dimensional  leature  space. 

d Classification  Rule.  It  is  further  hypothesized  that  the 
listener  will  establish  a measure  of  similarity  between 
the  set  of  observed  features  comprising  a perceived  pattern 
with  patterns  previously  learned.  On  the  basis  ci  this 


measure,  an  identification  of  the  sound  results  by  applying 
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classification  cules.  Denote  the  classification  resulting 
from  the  observables  fi(R). 

In  most  listener  classification  tasks,  the  sound  patterns 
and  hence  parameters  a occur  with  some  a priori  probability.  The 
probability  of  occurrence  of  various  sound  patterns  strongly  influences 
the  classification  problem  by  restricting  the  number  ot  patterns  which 
rhe  listener  expects  to  hear  and  by  effecting  the  classification  rule. 

2 2 Patterns  with  Dichotcmcus  Features 

as  a subset  of  the  general  classification  model,  consider  the 
case  of  two  signal  patterns  only  The  task  then  reduces  tc  one  of 
deciding  between  two  previously  learned  patterns  on  the  basis  ot  the 
observation.  Such  a decision  may  result  when  a lathe  operator  must 
decide  between  the  sounds  of  a properly  adjusted  machine  and  one  for 
which  ’he  cuf r ing  rate  is  excessive.  In  more  complex  classification 
tasks,  it  may  be  practical  tc  arrive  at  a final  classification  as  the 
tester  oi  twc-way  choices  defining  a decision  tree  (Reed,  1973). 

Whenever  the  two  sounds  differ  by  the  absence  cl  specili; 
features  in  one  case  where  they  occur  in  the  other,  the  features  are 
dichotomous.  It  only  a single  dichotomous  feature  fi..  distinguishes 
two  scu^d  patterns  uJ, , the  listener  is  faced  with  the 

following  choices: 

a.  When  the  observation  is  thought  to  be  associated  with  the 
presence  cf  feature  say  that  signal  pattern  u 


is  heard. 
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b.  Else  conclude  that  signal  pattern  is  heard. 

Stating  the  above  decisions  in  terms  ol  statistical  hypothesis 
tests,  define  the  two  hypotheses: 


o: 

Feature 

fl4 

1 

is  absent,  decide 

pattern  u)_ 

r 

Feature 

fti 

is  present,  decide 

pattern  w.  . 

In  the  limit  where  sound  pattern  oj^  consists  of  a single 
feature  51  , the  discrimination  problem  reduces  to  that  of  detection 
or  the  featu'e.  While  the  detection  problem  is  certainly  or  interest, 
this  thesis  is  concerned  with  the  classification  or  discrimination 
task  instead.  The  detection  problem  has  been  treated  elsewhere 
(Gteen,  1960a;  Gundy,  1961;  Stallard  and  Leslie,  1974;  Swets,  1964), 
buf  it  :s  instructive  to  state  seme  ot  the  applicable  concepts  here. 

It  is  reasonable  tc  suspect  that  the  acceptance  of  hypothesis 
Hj  above  requires  at  least  an  opportunity  for  the  listener  to  detect 
the  presence  cf  the  dichotomous  feature  ft  . It  will  be  seen  in 
Chapters  V.  and  Vll  that,  in  some  experimental  cases,  the  detection 
cf  a dichotomous  reafute  appears  to  be  an  adequate  basis  tor  a 
classifi  aticn  decision. 

2 3 The  Theory  or  Mgnal  Detectability 

The  theory  ot  signal  detectability  is  a mathematic  el  model  ot 
the  mapping  cf  a set  of  stimuli,  consisting  either  ot  noise  alone  or 
from  signal  plus  noise,  into  a two-point  decision  space:  "no"  the 

signal  was  rot  contained  in  the  input,  or  "yes"  the  signal  was 
contained  in  the  input  (Tanner  and  Sorkin,  1972).  While  these 
models  ate  not.  intended  to  be  descriptions  of  the  way  a human 


* 


observer  makes  this  binary  decision,  they  are  normative  models  with 
which  the  observer's  performance  can  be  compared. 

The  models  comprising  this  theory  are  applicable  to  any 
receiver  operating  on  a set  of  inputs  and  as  such,  do  not  take  into 
account  the  properties  of  the  human  per  se.  A number  of  workers 
have  successfully  demonstrated  the  utility  of  the  theory  of  signal 
detectability  of  visual  as  well  as  auditory  stimuli  (Green  and  Swets, 
1966;  Peterson,  Birdsall  and  Fox,  1954;  Swets,  1964).  It  is  generally 
found  that  the  human  performance  falls  short  of  that  predicted  for  an 
ideal  detector. 

When  applied  to  detection  of  marine  sounds,  the  performance  with 
broadband  noise  applies.  This  is  due  to  the  fact  that  marine  sources 
radiate  a broadband  spectrum  with  perhaps  some  significant  tonals 
(Kinsler  and  Frey,  1962).  Industrial  noise  sources  have  similar 
characteristics  (Ward  and  Fricke,  1969) . The  problem  of  detecting 
the  presence  of  various  broadband  components  reduces  to  a test  of 
hypotheses  about  signals  which  are  only  known  statistically.  That 
is,  the  exact  phases  of  the  components,  their  instantaneous 
amplitudes,  bandwidths,  etc.,  are  at  best  known  in  terms  of  their 
averaged  power  spectra.  The  theory  of  signal  detectability  for 
broadband  features  leads  to  a likelihood  ratio  test  for  signals 
known  statistically.  If  we  further  assume  that  the  noise-like  sound 
can  be  modeled  as  a collection  of  samples  with  identical  Gaussian 
probability  densities,  the  ideal  processor  is  an  energy  detector 
(Van  Trees,  1968). 
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In  accordance  with  the  sampling  theorem  in  the  time  domain,  a 
band-limited  signal  of  bandwidth  W existing  over  a time  interval  T 
can  be  exactly  determined  by  2WT  samples  (Shannon  and  Weaver,  1949). 
These  samples  can  be  chosen  to  be  statistically  independent  if  the 
process  is  Gaussian  (Lathi,  1968).  The  likelihood  ratio  test  is  then 
given  by 

U 

2WT  1 

X(R)  - l K2.  '<  Y « L(R2)  , (2.1) 

i»l  1 Ho 

where  the  threshold  y includes  all  sample  invariant  factors  as  well 
as  the  criteria  (Van  Trees,  1968).  It  is  seen  that  the  likelihood 
ratio  test  consists  of  computing  the  sum  of  squares  of  the 
statistically  independent  sample  data  points  and  comparing  this  to  an 
appropriate  threshold.  Since  the  R^s  aie  Gaussian  random  variables, 
the  sufficient  statistic  X(R)  Is  a random  variable  with  a gamma 
distr ibution 

P (X>  = ~ a”  x d-le-X'b  > 0 , ,x  <00  ^ (2.2) 

I'(d)6d 

The  parameters  of  the  gamma  distribution  correspond  to: 

d - 2 - WT,  8 = 2 cf  (Hogg  and  Craig,  1970). 

n 

He  parameter  is  given  by: 

a2  - c'  under  H_  if  R is  n(0,  C?) 
h n 0 n 

o'  = o?  4-  p2  under  H,  where  R is  n(0,  a2  + o2) 
h n s 1 ns 

The  notation  n(p,  o')  Is  used  to  denote  a random  vat  table  with 
Gaussian  probability  density  which  has  a mean  y and  a variance  a2 


13 


\ 


Equation  (2.2)  reduces  to  that  of  a chi-square  density  function  with 
v degrees  of  freedom  whenever  0^=1. 

Furthermore,  whenever  the  number  of  degrees  of  freedom 


! : 


' 


V = 2WT  > 100  , 

2 

the  random  variable  x(k  ) can  be  closely  approximated  by  a Gaussian 
density  with  the  following  parameters  under  and  (Abramowitz 

and  Stegun,  1964): 


PXl»o 

PXlHj 


(X | H-)  is  n (2WT  a2,  4WT  Cu) 

0 n n 

(X | H. ) is  n [ 2WT  (o2  + o2),  4WT  (o2  + a2)2)  . 

1 ns  ns 


(2.3) 

(2.4) 


The  performance  of  the  hypothesis  test  can  be  completely 
characterized  by  the  quantity  d'  (Van  Trees,  1968)  The  detectability 
d'  is  defined  as  the  distance  between  means  on  the  two  h>potheses  when 
the  variance  has  been  normalized  to  unity  (Swets,  1964)  An  identical 
definition  is 


(d’) 


2 


(E(X|Ht)  - E(X | Hq)  ) 2 
1 '2  (Vat (XjHj)  + Var (X]Hq) ] 


(2.5) 


where  the  variances  under  the  two  hypotheses  are  different. 

Substituting  the  mean  values  E(X|H  ),  E(X|Hq)  and  variances 
Var(X|H^),  VarlXlH^)  icr  the  Gaussian  signal  case,  obtain: 


r 


_ 


,i  1 " - . J:- 
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d' 


!'  ^WT)*  a“ 

i s 

{ 4WT  ou  + 4WT  a*  0*  -t  2WT  c" 
n n s ■ s 


1 1/2 
I 

f 


d’  = (WT) 

opt 


1/2 


o'[l/'2(o7o') J + 
nL  s n 


(a 


^TcT 


i] 


1/2 


(2.6) 


where  the  notation  d'  has  been  introduced  to  indicate  theoretically 

cpt 

optimal  performance  in  the  hypothesis  test.  The  human  observer  will, 
in  general,  not  attain  this  level  of  performance  (Green  and  Swets, 

1966) . 

For  the  small  signal-to-noise  case  where 


cr/o2  « l , 

s n 


Equation  (2  6)  reduces  tc, 


d'  = (WT)1^  o‘ / c' , o2.o"  <s  l 

cpt  s n s n 


12.7) 


2 . 4 Discrimination  of  Complex  Patterns 

The  aboi/e  results  for  the  detectability  cf  a band  of  noise  with 

bandwidth  W is  new  extended  to  the  discrimination  task.  Whenever 

two  scund  patterns  differ  by  a single  feature  and  this  feature 

is  a band-limited  noise,  feature  ft  has  an  associated  detectability 

d’  given  by  Equation  (2.6).  An  equivalent  measure  of  detectability 
opt 

can  in  theory  be  written  tor  all  features  which  distinguish  the  two 
sound  patterns 

Although  the  distinguishing  features  in  any  real  discrimination 
task  may  not  be  dichotomous,  the  sound  patterns  will  differ  in  the 
relative  detectabilities  cl  the  features.  Also,  whenever  a number  of 
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features  comp-ise  a sound  pattern,  it  can  be  expected  that  some 
features  will  occur  only  when  others  are  present.  The  features  are, 
therefore,  correlated  one  to  another  to  some  extent. 

The  general  discrimination  or  recognition  task  for  everyday 
sound  patterns  is  orten  one  involving  multiple  interacting  featu-es 
with  a range  of  detectabilities.  The  analysis  of  the  general  task  is 
extremely  complex  for  this  reason.  This  thesis  deals,  therefore,  with 
a rranageabie  subset  of  such  scund  patterns. 

Even  in  the  simple  case  of  a few  dichotomcus  features  with 
controlled  interactions,  it  can  at  the  outset  only  be  assumed  that 
the  detectability  cl  the  features  will  in  seme  way  be  related  to  the 
discr iminat icn  performance. 

The  experimental  te-hmque  discussed  in  later  chapters  was 
designed  to  test,  how  accurately  a model  ct  the  discrimination 
periorir.ance  based  cn  statistical  hypothesis  tests  on  dichotcmous 
teatu'es  agrees  with  human  performance.  Equation  (2.1)  tot  the 
likelihood  test  shows  that  the  acceptance  ct  hypothesis  depends 

on  a threshold  r as  well  as  cn  the  detectability  oi  the  signal 

f 

The  performance  cl  the  idtor  discriminator,  and  presumably  that  of 
the  subopt imal  huroar;  observer,  can  be  changed  by  readjusting  the 
threshold.  The  threshold  is  determined  by  the  decision  rules  end 
takes  irtc  account  the  observer's  knowledge  about  the  relative 
t-equency  . cur  retire  ct  the  signal  pattern  with  the  teatute 


preset  (Egan,  19  ’51. 


CHAPTER  III 


SEQUENTIAL  RECGGNIIICN 

3 . 1 Deterred  Decisions  m Recognition 

In  many  reccgn.ticn  tasks  c£  practical  interest,  the  confcunding 
ettect  of  extraneous  noise  precludes  an  immediate  classi tication 
decisicn.  For  disci lmmaticn  between  signals  with  dichotomous 
ieatures,  this  deferral  ot  a decision  is  equivalent  to  sequential 
hypo'hesis  testing  (Wald,  1 9 A 7 ) 

Ai  any  time  t , the  listener  is  comrcnted  with  three 
possible  decisions: 

a-  On  the  basis  ct  the  aural  stimulus,  decide  H^,  the 
tearure  Q is  absent. 

x 

b.  Given  ’■he  aural  stimulus,  decide  H,  , rhe  feature  12. 

1 l 

is  present 

c.  Deter  the  c lass  it  ic at  ion  de.isien  because  there  is 

ir.sut  t ic  lenr  evidence  tc  decide  either  cr  . 

lie  listener  will  hcose  his  course  cl  action  depending  both  on 
the  observables  and  on  1 is  mental  set  (Leeper,  1951).  Each  cl  the 
above  cruises  ot  action  has  an  associated  payoft  or  risk  which  depends 
cn  the  si'uation  in  which  the  discrimination  is  to  be  made.  For 
example,  the  risk  associated  with  a decision  to  stop  a machine  which 
involved  in  an  assembly  line  is  greater  than  is  that  associated 


rr 
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with  the.  decision  to  stop  a machine  tool  not  integral  to  a process 
nvolving  many  processing  steps.  The  decision  to  stop  the  machine 
is  here  assumed  to  be  the  result  of  an  aural  indication  of  abnormal 
performance.  The  listener's  knowledge  of  the  probability  of 
occurrence  of  signal  patterns  also  is  significant. 

Formally,  whenever  the  a priori  probabilities  cf  the  signal 
patterns  are  known,  and  a cost  can  be  associated  with  the  available 
courses  cf  action,  the  Bayes  criterion  can  be  used  to  minimize  the 
total  risk  (Van  Trees,  1968).  For  the  sequential  hypothesis  testing 
problem,  the  significant  parameters  are: 

a P^,  P^,  the  a prion  probabilities  of  u),  , u^, 
r especrively. 

b C , 1,  j = 1,  2 where  C..  is  the  cost  oi  deciding 
ij  ij 

in  tavor  of  signal  pattern  uu  when  signal  pattern  ui 
is  correct. 

c.  C , . , the  cost  of  deferiing  the  decision, 
det  ° 

Alternately,  if  the  a priori  probabilities  are  net  known,  cr 
if  it  is  net  reasonable  to  assign  costs  to  each  cf  the  courses  of 
action,  t may  be  more  apprcpiiate  to  set  a criterion  which  reduces 
the  probability  cf  a particular  error  nelcw  some  limit  The 
thresh  Id  adopted  in  this  case  will  be  chosen  using  rhe  Neyman- 
Pearson  tritericn 

Whatever  rhe  criterion  by  which  the  thresholds  tor  the 
available  ccarses  of  action  are  determined,  the  observed  performance 
will  be  strongly  lnlluenced  by  this  threshold.  A way  tc  characterize 
this  performance  under  various  criteria  is  by  means  of  a receiver 
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operating  characteristic  (ROC)  curve  (Fgan,  1975).  This  curve 
presents  the  probability  of  a detection  P(D)  vs  the  probability  of 
a false  alarm  P(FA).  As  the  listener  adjusts  his  criteria,  both  of 
these  probabilities  will  change. 

Under  a lax  criterion,  i.e.,  say  whenever  there  is  any 

evidence  of  the  presence  of  feature  the  number  of  correct 

discriminations  will  be  high  as  will  the  probability  of  a false 
classification.  A more  strict  criterion  will  decrease  both  of  these 
probabilities. 


3.2  Factors  Influencing  the  Terminal  Decision 

Of  the  possible  decisions  in  the  sequential  classification 
task,  a decision  to  accept  hypothesis  and  is  referred  to  as 

a terminal  decision.  The  way  such  a terminal  decision  is  made  in  a 
detection  problem  has  been  treated  elsewhere  and  will  not  be 
reiterated  (loeb  and  Binford,  1970),  The  concept  is  extended  to  a 
classification  task  below. 

Consider  the  sequential  classification  as  consisting  of  a 
series  of  hypothesis  tests  corresponding  to  dichotomous  features. 

Each  such  test  constitutes  a detection  opportunity  for  feature 
If  the  hypothesis  and  both  have  finite  probability  of 

occurrence,  it  is  only  a question  of  time  until  a terminal  decision 
will  occur  by  chance  alone. 

Wald  (1947)  extended  the  concept  of  the  likelihood  ratio  test 
to  the  sequential  problem.  The  sequential  probability  ratio  test  is, 
at  the  i'th  stage. 
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x Pr|Hl  (VV  H0 

<R>  'jli  Pr|H0  <Rj|H0> 


, A < B 


(3.1) 


else  defer  the  decision. 

Here  A and  B aie  the  thresholds  for  deciding  H^  and  H^, 
respectively  He  gees  cn  to  prove  that  such  a test  will  always 
terminate,  and  he  derives  the  expected  number  of  observations  for  a 
reiminai  decision 

Ihe  definition  ct  a detection  opportunity  must  be  made  very 
pre- isely  it  a relationship  between  signal  detection  theory  and 
operator  performance  is  to  be  expected.  Furthermore,  the  sequential 
probability  ratio  test  assumes  that  the  probability  densities  under 
Hq  and  remain  unchanged  trom  observation  to  observation  The 

actual  derect  cn  occurs  in  a situation  where  the  relationship  ct 
extraneous  noise  tc  scund  of  interest  will  normally  change  as  a 
function  ci  time  The  c lass i t icat ion  rules  are  also  likely  tc  change 
:r  e -eal-wo. Id  situation.  If,  tor  example,  the  time  to  ceme  tc  a 
decision  is  very  long,  the  listener  may  relax  the  criterion  ter  the 
sake  cf  ending  fhe  uncertainty. 

3 3 Forced  Response  vs  Sequential  Classification 

C lassli icaT lens  which  imply  only  the  detection  cr  one  dichctcmcus 
feature  may  be  conveniently  visualized  by  the  probability  densities 
under  and  H^.  Whenever  the  listener  must  either  make  a yes-no 

(VN)  or  a leered  ch  ice,  two  alternative  (2aFC)  response,  a single 
threshold  divides  rhe  decision  space  into  twe  distinct  regions. 

Figure  2a  shews  the  probability  densities  and  erior  regicns  tor  this 
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situation  fcr  the  detection  of  a band  of  noise  in  noise.  The  error 


regions  are: 


a.  a = P (deciding  hJHq)  = P(FA)  = / p^  (X|Hq)  dx  (3,2) 

Y 

b B = P (deciding  HqJHj)  = 1 - P(D)  = / p^  1X|H1>  dx  .(3.3) 


The  error  regions  at  time  for  sequential  detection  are 

shown  in  Figure  2b  The  probability  of  an  error  in  classification  will 
be  given  by: 


P(E)t  - P(H0)  P(X|H0)r  + P(H1)  P(X|H1)t 


Qr  P(HQ)  + 6t  P(H1)  , 


\ *•  : PjH  (XIHQ)  dx,  Bt  = / p^,H  (X  i H1 ) dx  , (3.5) 

8 


In  practice,  the  criterion,  hence  threshold,  will  te  set  by  the 
listener  in  response  to  the  situation.  The  numeric  value  of  the 
threshold  T wi.l  not  be  known  but  can  m the  forced  respcnse  case 
be  inferred  from  the  observed  values  of  P(FA)  and  P(D)  The 
sequential  detection  and  classlt ication  case  presents  some 
difficulties,  however  Especially  when  the  distributions  under 
and  H,  are  changing,  knowledge  of  ix  and  Bf  is  it  sell  net 
adequate  tc  define  y and  y without  assumptions  of  the  terminal 

de'  i sioti  ptccess- 
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The  experimental  approach  described  in  Chapter  IV  is  designed  to 
elicit  information  about  classification  performance  when  the  criterion 
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CHAPTER  IV 


DESIGN  OF  AN  EXPERIMENT  IN  AURAL  RECOGNITION 


4 . 1 Intrcduc t ion 

This  chapter  presents  the  rationale  tor  the  design  of  an 
experiment  in  aural  recognition.  The  experiment  incorporates  some 
of  the  methods  used  in  classical  threshold  techniques.  Also,  the 
data  are  to  be  analyzed  from  the  point  of  view  of  signal  detection 
theory.  The  differences  between  the  threshold  techniques  and 
experimental  methods  used  by  researchers  in  signal  detectability 
are  discussed.  The  methods  are  compared  and  an  experimental 
technique,  here  called  the  "modified  threshold  procedure,"  is 


desc  r ibed 


In  late.c  sections  of  the  chapter,  an  extension  of  the  technique 
to  a confiden-e  rating  scheme  is  described.  This  latter  approach  is 
designed  tc  map  out  the  functional  relationship  between  confidence 
level  and  operator  performance. 

The  safeguards  exercised  in  experiments  using  subjects  are 
detailed  in  Section  4.6.  The  calibrations  and  loudness  controls  are 
als-:  described  in  that  section. 


been  investigated  by  threshold  methods  (Boring,  1950)  In  this 
approach,  rhe  stimulus  paiametei  to  be  measured  is  changed  slowly 
until  a just  perceptible  difference  in  the  physiological  response 
occurs  (Licklider,  1951).  One  related  area  where  this  technique  has 
been  used  extensively  is  in  the  Investigation  of  the  Icudness  of 
various  spectrally  shaped  noises  (Cremer,  Plenge,  and  Schwarzl,  1959; 
Zwicket , 1958;  Zwislocki,  1969). 

The  results  of  these,  threshold  experiments  can  be  explained 
using  rhe.  theory  of  signal  detectability  (Treisman  and  Watts,  1966). 

It  can  be  inferred  that  the  listener  In  such  experiments  operates  in 
a sequential  detection  or  recognition  environment.  Whene’er  rhe 
established  criterion  is  exceeded  during  a period  of  increasing 
st  me  ..us  intensity,  the  threshold  can  be  thought  to  have  been  exceeded. 

It  sh  aid  be  pointed  out,  however,  that  single  cr  multiple 
thresholds  i r sensual  inputs  have  been  postulated  as 'explaining  the 
observed  esults-  Also,  quantization  of  the  auditory  process  has 
been  suggested  (Licklider,  1951)  While  this  thesis  is  not  concerned 
with  the  analysis  of  the  detailed  mechanisms  which  are  active  in  the 
human,  there,  a-e  fundamental  differences  in  approach  between 
experiments  using  threshold  techniques  and  those  designed  to  test 
various  aspects  ol  the  signal  detection  theory. 

Detecti'-n  theory  related  experiments  are  designed  to  elicit 
either  data  about  rhe  RCC  curve  or  the  psychometric  tunction  ct  the 
listener-  The  psychometric  function  is  the  variation  of  pet  cent 
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correct  detection  or  classification  as  a function  of  stimulus  level. 

The  key  parameters  are  usually  the  various  probabilities  of  the 

available  courses  of  action  and  the  detectability  d'  expected  on 

opt 

the  basis  of  some  signal  detection  model.  The  detectability  is 
usually  defined  as  the  normalized  difference  of  means  of  a statistic 
under  two  alternate  hypotheses. 

The  signal  detection  theory  types  of  experiments  present  the 
stimuli  tc  a listener  at  seme  signal-to-noise  ratio  and  observe  the 
performance.  The  criterion  is  usually  specifically  included  by  way 
of  instructions  prior  to  the  test.  Data  are  taken  at  a number  of 
signal-to-noise  ratios  cr  other  fixed  values  of  a parameter  (Swets, 
1964).  For  YN  type  experiments,  the  responses  are  either  "yes"  the 
signal  was  present  in  a trial  period,  or  "no"  it  was  not.  In  forced 
choice,  N alternative  (NAFC)  experiments,  the  listener  indicates  in 
which  one  of  N trial  periods  the  signal  occurred.  The  signals  are 
usually  presented  in  the  presence  of  an  interferring  noise  at  some 
preselected  level. 

The  YN  or  NAFC  experiments  have  the  advantage  that  the  results 
are  subject  tc  well  known  and  easily  applied  statistical  hypothesis 
resting  techniques  as  are  described  in  Chapter  II  (Egan,  1975), 

4 3 A Pilot  Stud^  and  Its  Impact 

Initially,  a pilot  study  was  conducted  using  University  students 
as  subjects  to  identify  weaknesses  in  experimental  techniques  and  to 
obtain  bounds  on  listener  performance.  This  was  an  identification 


task  in  which  the  listener  was  asked  to  decide  which  of  two  marine 


sounds  (che  exposure  set)  was  subsequently  presented  in  a broadband 
ncise  background.  The  signals  were  presented  binaurally  from  tape 
recordings  in  an  experimental  arrangement  very  similar  to  that  which 
is  described  in  later  sections.  However,  for  these  initial  tests, 
the  unknown  cr  probe  stimulus  was  presented  for  a 20-second  period  of 
time  at  a fixed  signal-to-noise  ratio.  The  listener  was  asked  to 
record  cn  an  answer  sheet  his  or  her  assessment  of  which  member  of  the 
exposure  set  the  probe  corresponded  to.  In  addition,  the  third 
alternative  "don't  know"  was  to  b^  indicated  whenever  "reasonable" 
doubt  existed. 

Because  the  sounds  presented  do  not  fall  within  the  range  of 
normal  auditory  experience  for  these  listeners,  an  immediate  area  of 
difficulty  arises.  One  is  precluded  from  using  techniques  applicable 
to,  for  example,  speech  intelligibility  studies  or  other  recognition 
tasks  ter  meaningful  material  where  the  subjects  can  be  assumed  to 
have  a common  learned  ability  to  distinguish  between  stimuli  (Pollack, 
1948;  Olach,  1964).  A time  consuming  stimulus  training  period  is 
required  prior  to  any  probe  measurement.  Since  the  stimuli 
characteristics  cannot  be  reliably  verbalized,  an  arbitrary 
designation  ct  A and  B were  associated  with  the  trained  stimuli.  Such 
arbitrary  association  of  letters  with  the  members  of  the  exposure  set 
is  unfortunate,  However,  the  method  used  by  Pollack  (1959)  to 
circumvent  this  shortcoming,  the  method  of  recognition  memory,  cannot 
be  used  in  the  present  case.  In  the  method  of  recognition  memory, 
the  probe  is  chosen  from  an  augmented  set  which  includes  the  members 
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of  the  exposure  set  and  members  of  another  set,  the  confusion  set. 

Tb'  listener  response  then  consists  of  an  assessment  of  whether  the 
probe  in  fact  corresponded  to  a member  of  the  exposure  set.  The 
marine  sounds  are  such  that  the  experimenter  cannot  know  if  the 
confusion  signals  will,  in  fact,  sound  "like"  one  of  the  exposure 
signals. 

A test  event  consisted  of  an  exposure  (learning)  period  for  the 
two  signals  without  interfering  noise.  After  a short  pause,  the 
probe  was  presented  in  the  interfering  noise.  Subsequently,  the 
exposure  set  was  repeated  in  a refresh  period,  or  a new  set  of 
signals  was  exposed  for  learning.  No  feedback  was  used  except  that 
in  some  cases  the  subjects  were  appraised  of  their  performance  at  the 
end  cf  a test  session.  Each  test  event  requires  about  two  minutes 
to  complete  for  a total  of  15  to  18  events  per  session. 

Little  was  known  at  the  time  of  this  pilot  study  about  the 
range  of  signal-to-noise  ratios  at  which  the  listener  performance 
would  attain  some  pre-determined  probability  of  correct  response 
P(C)  It  was  necessary,  therefore,  to  present  the  prcbe  at  a number 
ci  values  of  signal-to-noise  ratios.  Using  four  signal  pairs  of 
interest  and  five  values  of  signal-to-noise  ratio,  there  were  20 
performance  indices  to  be  estimated.  As  will  be  shown  later,  at  least 
50  data  points  are  desired  for  each  performance  index  for  a total  of 
1000  events  or  about  60  sessions. 

The  availability  of  trained  listeners  was  limited,  and  not  all 


performance  indices  were  adequately  studied.  This  method  was 
abandoned  in  favor  of  the  ramped  signal-to-noise  ratio  technique 
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discussed  in  subsequent  sections  of  this  report  primarily  because  a 
faster  method  was  required.  This  early  study  was  very  useful  in 
identifying  several  weaknesses  in  procedures  and  equipment  design. 

A . A A Mod i f i e d Th res h old  Procedure 

Most  of  the  results  presented  in  this  report  were  collected  using 
a modification  of  threshold  procedures.  The  scheme  used  is  modeled 
after  the  techniques  used  by  Cremer  et  al.  (1959)  and  Zwicker  (1958) 
to  determine  the  loudness  of  various  bands  of  noise.  Dubrovskii  and 
Tumarkina  (1967)  also  used  an  experimental  technique  which  is  very 
similar  to  the  one  described  below.  Some  considerations  which 
strongly  influenced  the  experimental  technique  used  in  this  thesis 
were: 

a.  Because  it  was  desired  at  the  outset  to  make  the  listener 
related  portion  of  the  system  portable,  the  signals  are 
pre-recorded  on  magnetic  tape.  Also,  the  response  recorder 
is  of  necessity  simple. 

b.  It  was  desired  to  automatically  record  responses  without 
the  test  conductor  needing  to  be  present.  Subjects  were 
thus  able  to  conduct  tests  at  their  own  discretion  with  a 
minimum  of  scheduling  difficulties. 

c.  The  process  of  data  reduction  was  to  be  a simple  one  with 
a minimum  of  manual  data  handling. 


In  the  modified  threshold  procedure,  the  probe  is  initially 
presented  in  a broadband  noise  background  at  a low  signal-to-noise 
ratio.  This  signal-to-noise  ratio  (SNR)  is  then  increased  step-wise 
with  time  until  such  time  as  the  listener  can  make  a determination  of 
which  exposure  signal  the  probe  corresponds  to.  This  method  has  an 
advantage  in  its  similarity  to  the  sequential  classification  task 
encountered  in  many  real  situations.  However,  the  results  only 
provide  data  about  the  probability  of  correctly  classifying  at  the 
SNR  needed  for  a high  confidence  terminal  decision 

The  sequence  of  occurrences  during  each  test  event  is  shown  in 
Figure  3-  The  exposure  signals  A and  B are  chosen  with  randomization 
from  the  set  of  auditory  patterns  of  interest.  Generally,  these 
patterns  differ  by  one  or  more  features.  The  initial  and  refresh 
exposures  are  of  fixed  dutation  which  the  listeners  generally  agreed 
on  as  sufficient  for  later  recognition.  The  prcbe  SNR  is  always  low 
initially,  but  'his  value  is  randomized  to  avoid  listeners  responding 
cn  the  basis  of  time  instead  of  the  perceived  aural  response. 

Responses  were  indicated  by  listeners  by  pressing  either  a 
switch  marked  A or  one  marked  B.  Once  the  listener  indicated  a 
decision,  the  auditory  signal  was  blanked  for  the  remainder  of  the 
response  period.  The  duration  of  the  response  period  was  also 
randomized.  By  not  knowing  when  the  response  period  would  end,  the 
listeners  were  in  effect  assigned  a cost  for  continuing  in  the 
listening  state  It  was  found  that  subject  performance  could  be 

established  by  means  of  verbal  instructions  prior  to  each  test 
session.  Once  trained,  the  subjects  maintained  a consistent  criterion 
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which  was  described  in  the  instructions  as  "reasonably  certain." 

The  pre-recorded  signals  were  played  back  using  a Crown  700  tape 
player  and  a pair  of  Telephonies  Model  TDH-39  dynamic  earphones.  The 
signal  was  fed  m phase  to  a matched  set  of  phones  installed  in  an 
audicmetric  beeth  where  practical.  Tests  done  cn-site  using  Navy 
personnel  were  conducted  in  the  quietest  classroom  available  Figure 
diagrams  the  equipment  used  at  the  listener  location.  The  dual 
channel  audio  source  tapes  contain  a pilot  channel  for  response 
recorder  control  and  a channel  with  the  stimulus  signal.  This  latter 
tape  channel  is  recorded  with  the  exposure,  prebe,  and  interfering 
noise  signals.  Control  signals  recorded  on  the  pilot  channel  consist 
of  a 12. 5 kHz  reference  tone  during  the  exposure  period  and  a tene 
with  frequency  proportional  tc  SNR  during  the  response  period.  These 
control  signals  are  decoded  by  means  of  a phase  locked  loop  set  to 
12  5 k.H2  and  a level  detector.  Digital  logic  is  then  used  to  control 
the  state  of  the  response  recorder  and  of  a cassette  recorder , 

A Scny  cassette  recorder  is  used  to  log  responses  which  are 
ceded  as  frequencies  The  tene  proportional  to  SNR  is  also  output 
tc  the  cassette  during  the  response  period  prior  tc  the  subject's 
responding  During  the  exposure  period,  the  cassette  recorder  is 
stopped  so  that  a session  which  lasts  nearly  an  hour  requires 
considerably  less  cassette  time.  The  equipment  ar  the  listening 
si'e  is  pictured  in  Figure  5. 

Data  were  analyzed  in  twe  ways.  Initially,  the  cassette  tapes 
were  played  ba<- k cn  another  cassette  tape  unit  and  output  on  a cash 
-egister  tape  using  a printing  frequency  ccunrer,  The  SNR  at  the 
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moment  of  subject  response  and  the  classification  (A  or  B)  were 
detetmined  manually.  As  the  number  of  data  points  increased,  the 
results  were  key  punched  and  copied  onto  a computer  magnetic  tape 
for  retention  and  subsequent  data  reduction.  Later,  the  process  was 
automated  by  outputting  the  counter  frequency  directly  to  phase 
encoded  digital  tapes.  A buffered  Pertec  digital  tape  unit  was 
used  for  this  purpose.  These  IBM  compatible  digital  tapes  are 
amenable  to  direct  processing  by  computer. 

4 . 5 A Continuous  Rating  Experiment 

The  terminal  decision  logged  in  the  above  method  does  not  give 
enough  information  to  allow  separation  of  criterion  effects  from  those 
due  to  the  detectability  of  the  stimulus.  In  order  to  obtain 
information  of  how  performance  shifts  with  criterion,  a continuous 
rating  experiment  was  also  undertaken.  In  this  approach,  the  subject 
continuously  indicates  his  tentative  classification  (A  or  B)  as  well 
as  his  degree  cl  certainty.  The  responses  are  indicated  by  moving  a 
linear  potentiometer  control  from  side  to  side.  The  potentiometer 
positions  are  annotated  with  a numeric  scale  ranging  from  0 to  10 
as  is  shown  below. 

A B 

10  864202468  10 

The  listeners  were  instructed  to  attempt  a preliminary,  albeit 
lew  confidence  r lassif icat ion  as  soon  as  possible  and  to  modify  this 
decision  as  more  data  became  available.  As  the  SNR  increased,  they 
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would  tend  to  move  up  the  scale  tc  higher  confidence  levels.  These 
experiments  are  hereafter  referred  to  as  '’slide  wire"  tests. 
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Foi  the  slide  wire  tests,  a number  of  subjects  were  tested  at 
once-  Up  to  five  response  recorders  were  set  up  and  each  subject 
lccaticn  was  equipped  with  its  own  response  recorder  and  a pair  of 
Pioneer  SE205  circumauial  earphones.  Responses  were  output  to  a 
six-channel  strip-chart  recorder  along  with  a voltage  proportional 
to  SNR  This  latter  voltage  was  derived  from  the  pilot  channel  of  the 
dual  channel  audio  tape.  The  SNR  prcpcrtional  frequency  was  converted 
to  a voltage  by  means  of  a frequency  to  voltage  converter.  The 
subject  related  portions  of  the  equipment  used  in  slide  wire  tests  is 
pictured  in  Figure  6 

The  data  on  the  strip-chart  recorder  output  were  manually  reduced 
to  obtain  the  confidence  and  classification  decision  at  various 
s ignal-to-n: ise  radios  These  data  were  also  ultimately  reduced  tc 
computer  compatible  form 

4 6 listener  Safeguards  and  Calibration  Techniques 

Prior  to  beginning  experiments  using  human  subjects,  approval 
was  obtained  in  compliance  with  the  Institutional  Assurance 
provisions  of  The  Pennsylvania  State  University  For  approval,  a 
prospectus  of  the  study  was  submitted.  This  proposed  approach  was 
reviewed  by  a University  Committee  to  ascertain  that  no  danger  to 
the  listeners  wc uld  resol'  Specifically,  the  levels  and  duraticns 
of  signals  are  such  that  no  damage  risk  is  incurred  (American 
Academy  ci  Opt halmc logy , 1957;  Each  subject  was  instructed  as  to 
his  tele  as  participant  in  the  study,  and  it  was  pointed  out  that 


under  n:  conditions  would  the  sounds  be  sc  loud  as  to  cause  discomfort. 
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See  also  Appendix  A. 

In  order  to  eliminate  the  effect  of  shift  in  subjective  loudness 
as  the  probe  signal  tc  interfering  noise  ratio  changed,  a balanced 
mixer  followed  by  an  automatic  loudness  control  was  used.  The 
loudness  as  presented  to  the  subject  was  carefully  maintained  at  a 
loudness  level  of  65  phons  (GD)  (ISO  R532).  This  loudness  level  was 
verified  from  1/3-octave  band  measurements  of  the  voltage  function  to 
the  headphones  and  taking  into  account  the  factory  provided  earphone 
calibration  with  the  MX41/AR  ear  cushions.  This  loudness  level 
applied  to  the  exposure  signals  as  well  as  the  probe  signal  in  noise. 
The  automatic  lcudness  control  used  was  a relatively  simple  one 
consisting  of  an  automatic  level  control  with  B weighting  in  the 
feedback  (Peterson  and  Gross,  1963).  The  marine  sounds  used  were 
simple  enough  so  that  there  was  no  need  to  use  complex  circuits  for 
control  of  lcudness  as  are  needed  in  general  sound  broadcast  work 
(Bauer  and  Totick.,  1966) 

Since  it  was  not  possible  to  directly  calibrate  the  circumaurai 
earphones  used  in  the  slide  wire  experiments,  the  loudness  was 
adjusted  using  a loudness  balance  technique.  For  this  adjustment, 
the  same  noise  signal  was  fed  to  one  of  the  TDH-39  calibrated  phones 
and  to  one  of  the  Pioneer  phones  and  the  level  to  the  latter  phone 
was  adjusted  until  the  reference  signal  on  the  left  ear  sounded 
equally  as  loud  as  that  in  the  uncalibrated  phone  on  the  right  ear. 

The  positions  were  then  interchanged.  This  calibration  is  not 
especially  accurate  because  it  is  difficult  to  maintain  proper 
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pressure  on  the  TDH-39,  and  also  the  circumaural  phones  are  sensitive 
to  the  shape  of  the  pinna  and  the  subject’s  hairline,  However,  the 
absolute  calibration  need  not  be  very  precise  for  the  following 


reasons : 


a.  The  loudness  is  quite  constant  and  sufficiently  high  so 
that  all  signals  are  well  above  the  auditory  threshold 
and  where  the  ear  is  reasonably  linear, 

b.  Since  there  is  no  discomfort,  and  the  calibration  is 
probably  no  more  than  10  phons  in  error,  the  damage  risk 
to  the  listeners  is  never  significant. 

c.  Differences  between  units  of  the  TDH-39  and  Pioneer 
earphones  in  frequency  response  will  tend  tc  be  at  very 
low  and  high  frequencies.  These  are  frequencies  which  do 
not  affect  the  dichotomous  feature  perception  for  the 
scunds  of  interest. 

The  SNR  was  controlled  by  means  of  a precision  balanced  mixer 
with  step  increments  of  about  1/2  dB.  The  shaft  of  this  mixer  is 
connected  through  a reduction  gear  to  a potentiometer  which  varies 
the  voltage  tc  a voltage  controlled  oscillator.  This  frequency  was 
measured  at  each  step  of  the  mixer  and  used  as  input  tc  subsequent 
data  reduction.  The  combined  frequency  uncertainty  due  to  flutter 
and  wcw  in  the  Crown  tape  player,  the  Sony  cassette  recorder,  and 
the  final  cassette  player  was  less  than  + 1/4  of  the  frequency  shift 
resulting  from  a change  of  SNR  of  1/2  dB. 
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Each  of  the  two  exposure  signals  was  analyzed  and  plotted  against 
the  interfering  noise  background  using  1/3  octave  analysis  accurate 
to  1/2  dB.  This  calibration  was  performed  with  the  balanced  mixer 
set  to  2ero.  The  linearity  of  the  balanced  mixer  was  checked  with 
two  widely  spaced  tones  and  the  relationship  of  mixer  setting  to 
resulting  SNR  was  determined  and  served  as  a correction  in  subsequent 
processing 

Using  these  calibrations  and  correction  terms,  the  SNR  output 
to  the  tape  recorder  used  for  recording  the  primary  tapes  is 
therefore  known  for  all  combinations  of  background  noise  and 
balanced  mixer  settings.  Subsequent  recording  steps  were  also 
carefully  monitored  to  insure  that  the  dynamic  range  of  the  tape 
medium  was  never  exceeded.  These  steps  then  insure  that  the 
effective  SNR  presented  tc  subjects  could  be  accurately  determined. 

The  calibration  and  correction  curve  accuracies  are  such  that  SNR's 
can  be  determined  to  within  1/2  dB. 


CHAPTER  V 


EXPERIMENTS  IN  AUDITORY  PATTERN  DISCRIMINATION 
5. 1 Summary  of  Experiments 

The  two  techniques  described  in  the  previous  chapter,  modified 
threshold  and  slide  wire,  were  used  to  measure  the  auditory  pattern 
discrimination  for  noise-like  sounds.  Experiments  were  conducted 
ever  a period  of  about  a year  using  trained  University  student 
listeners  as  well  as  submarine  sonar  operators  with  various  degrees 
of  experience. 

This  chapter  describes  the  experiments  in  detail  as  to  signals 
used,  statistical  methods  applied  to  data  reduction,  and  subject 
training.  The  rationale  for  choosing  the  signals  used  in  the 
discrimination  tasks  is  presented  in  Section  5.2  along  with 
characterizations  of  the  marine  sounds.  Section  5.3  presents  the 
statistical  testing  techniques  used  and  then,  in  Section  54,  the 
steps  taken  to  train  the  listeners  in  this  paradigm  are  described. 

It.  is  argued  that  consistent  performance  was  attained  with  the  two 
populations  of  listeners  used. 

The  student  subjects  only  participated  in  the  modified 
threshold  experiments.  For  student  tests,  the  signals  were 
presented  under  calibrated  earphones  in  an  audiometric  booth  located 
in  the  Applied  Science  Building  of  The  Pennsylvania  State  University. 
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Seven  students  were  Involved  in  those  tests  and  they  were  concurrently, 
or  had  been,  participants  in  other  psychoacoustic  studies  at  the 
University.  There  were  three  advanced  standing  female  students  and 
four  graduate  school  males  with  normal  hearing  as  verified  by  current 
audiograms.  These  listeners  were  free  to  conduct  tests  at  their  own 
convenience  with  the  following  restrictions: 

a.  No  two  test  sessions  could  run  sequentially. 

b.  No  more  than  two  test  sessions  could  be  held  cn  one  day. 

The  students  averaged  one  rest  session  per  week. 

Submarine  sonar  operators  participated  in  both  slide  wire  and 
modified  threshold  experiments.  These  tests  were  conducted  in  the 
quietest  classroom  available  at  the  US  Naval  Submarine  Training 
Center,  Pacific,  San  Diego  detachment.  These  operators  tcck  part  in 
one  session  per  day  for  five  days.  Two  such  week-lcng  tests  were 
conducted  using  a total  of  12  sonar  operators  with  between  6 months 
and  20  years  experience.  During  these  tests,  the  authcr  cr  an 
assistant  were  present  at  all  times  when  tests  were  in  session.  All 
cf  these  operators  had  been  trained  for  passive  submarine  sonar 
tasks  and  all  had  some  experience  in  that  sertmg.  Those  ope 'ate  rs 
with  extensive  ship-board  experience  were  at  the  time  serving  as 
instructors  at  the  training  center. 

The  calibrated  TDH-39  phones  were  used  with  sonar  operators 
tor  modified  threshold  tests.  Although  attempts  were  made  to  find  a 
quiet  location  tor  the  tests,  some  rntertering  sounds  in  the  room 
resulted  trum  vehicular  ttafflc  outside.  Extensive  note-  were  taken 
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throughout  the  tests  of  such  inter rerences  and  suspect  events  were 
eliminated  from  the  data  later.  Several  operators  participated  in 
the  slide  wire  experiments  at  the  same  time.  The  Pioneer  phones 
discussed  in  Chapter  IV  were  used  for  these  tests. 

5. 2 Choice  of  Noise-Like  Signals 

The  objective  of  this  work  was  to  quantify  subjective  responses 
to  sounds,  especially  in  the  discrimination  between  similar  ncise-like 
stimuli.  To  achieve  this  objective  and  to  allow  the  use  of  sounds 
which  play  a significant  role  in  an  everyday  situation  of  interest, 
marine  sounds  with  very  well  defined  characteristics  were  chcsen. 

Since  submarine  sonar  operators  are  routinely  required  to  distinguish 
between  marine  sounds,  this  is  cne  area  where  discrimination  ability 
1s  known  tc  be  important.  Furthermore,  the  trained  sonar  operators 
constitute  a population  of  listeners  with  hopefully  comparable 
training  and  similar  work-a-day  environments.  The  marine  sounds  are 
also  ones  tor  which  discrimination  can  be  easily  trained  with 
experienced  listeners  as  are  the  University  students  used. 

Many  industrial  noises  have  characteristics  which  are  similar 
to  the  marine  sounds.  In  fact,  the  marine  sources  exhibit  broadband 
noise  spectra  which  are  caused  by  the  same  mechanisms  leading  to 
many  sounds  in  industry,  principally  hydrodynamic  sources  and  the 
composite  oi  many  independent  vibrating  components.  Kinsler  and 
Frey  (1962)  describe  the  radiated  sound  of  ships  as  resulting  from 
"cavitation,"  a hydrodynamic  mechanism,  and  from  "machinery,"  a 
source  common  to  most  industrial  situations. 
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The  stimuli  used  in  the  present  tests  were  of  twc  types.  The 
first  type  of  sounds  to  discriminate  were  actual  marine  sounds  which 
were  contrasted  with  the  same  sound  modified  by  removal  of  selected 
features.  In  addition,  there  were  sounds  generated  from  stationary 
noise  sources  to  have  controlled  characteristics  resembling  those  of 
the  marine  sources-  Two  such  customized  sounds  were  then  contrasted 
in  the  discrimination  tests.  The  two  laboratory-generated  sounds 
used  in  such  tests  differed  by  one  or  mere  dichotomous  features. 

The  sound  patterns  u^,  and  associated  with  the 

unmodified  marine  recordings  are  characterized  in  Table  1.  These 
sounds  were  from  magnetic  tape  recordings  and  were  processed  by  a 
whitening  filter  to  reduce  the  overall  dynamic  range.  The  whitening 
filter  removes  the  approximately  6 dB  per  octave  decrease  in  sound 
pressure  level  with  increasing  frequency  as  is  exhibited  by  many 
marine  sounds  (Kinsler  and  Frey,  1962).  See  Appendix  B for  a 
detailed  description  of  equipment  used  to  process  the  raw  data  and 
for  generation  of  the  final  audio  tapes. 

The  dichotomous  features  tested  were  the  result  of  modifications 
of  the  original  marine  recordings.  These  modifications  consisted  of 
filtering  or  removing  of  amplitude  modulation  by  using  a high  speed 
automatic  gain  control.  The  resulting  signal  patterns  are  listed 
below: 

a.  High  pass  filtering  with  a sharp  filter  with  3 dB  down 

point  at  707  Hz  to  conform  to  the  edge  of  octave  30  results 
in  signal  patterns  identified  as  tjJ2H30’  ancl  ^31130" 
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TABLE  1 

SUMMARY  OF  SOUNDS  ASSOCIATED  WITH  SOUND 

patterns  o^,  w 2 , and  ca3>  marine  sounds 


Sound 

Pattern 

Description  of  Its  Sound 

*1 

High  speed  rhythmic  sound  with  pronounced  bursts 
of  high  frequencies  but  of  short  duration. 

W2 

Very  regular  rhythm  like  that  ct  a steam  engine 
or  slow  train.  Some  irregular  high  frequency 
popping. 

W3 

Noise-like  without  any  noticeable  rhythm. 
Sounds  like  that  of  high  speed  air  but  at  the 

same  time  having  a low  rumbling  component. 
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b.  High  pass  filtering  with  a sharp  filter  with  cutoff  frequency 
of  354  Hz  to  conform  to  the  edge  of  octave  band  27  results  in 

signal  patterns  ^2H27  and  UJ3H27' 

c.  Elimination  of  temporal  amplitude  fluctuations  slower  than 

about  70  Hz  results  in  signal  patterns  ^ISN  uj  2SN ’ Here 

the  suffix  SN  is  used  to  indicate  that  the  resulting  pattern 
is  approximately  a stationary  noise. 

Table  2 summarizes  the  signal  pattern  pairs  tested  with  either 
student  or  sonar  operator  subjects.  The  number  of  data  points  with 
each  group  shown  in  Table  2 is  the  number  of  useful  events  after 
training  events  and  those  for  which  results  are  suspect  due  to 
extraneous  noise  interference  or  response  recorder  malfunctions  have 
been  eliminated.  This  tctal  does,  however,  include  data  collected 
using  either  modified  threshold  or  slide  wire  techniques  The  tabular 
headings  auH^  and  to:H^  are  used  to  indicate  the  signal  pattern 
with  the  dichotomous  features  absent  and  present,  respectively. 

The  reasons  for  choosing  these  modified  characteristics  cf 
noise-like  sounds  are  treated  in  greater  detail  in  Chapters  VI  and 
VII.  The  choice  of  filter  cutoff  frequencies,  however,  was  made  sc 
as  'o  conform  to  recommended  octave  bands  as  detailed  in  USA  Standard 
SI. 6-1967  to  insure  repeatability. 

Customized  signals  were  made  by  filtering  stationary  broadband 
noise  and  by  modulating  the  noise  in  some  cases.  These  signals  were 
designed  to  exhibit  many  of  the  characteristics  of  the  marine  sounds 
but  with  precisely  defined  properties.  Table  3 describes  signal 
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patterns  to  tc  to  . The  experiments  performed  using  the  customized 
signals  are  summarized  in  Table  4.  None  of  these  experiments  was 
done  using  student  subjects. 

The  background  or  interfering  noise  against  which  the  unknown 
or  probe  signal  was  presented  was  an  ocean  ambient  for  signal  patterns 
co^  , u^,  and  (Perrone,  1970).  This  ocean  ambient  was  the  result 

of  1/3-cct.ave  filters  summed  with  appropriate  weighting  and 
exhibited  a nearly  flat  spectrum  with  somewhat  accentuated  low 
frequency  components.  The  background  noise  used  with  signal  patterns 
co^  to  0)g  was  a white  noise  with  the  very  low  frequencies  filtered 
cut  to  avoid  audio  tape  saturation.  Figure  7 shows  the  1/ 3-octave 
spectra  of  the  background  noises  used  Note  that  a spectrum  which 
is  white,  ie.,  constant  energy  per  Hz  across  frequency,  will  have  a 
1/3-ortave  spectrum  which  increases  at  the  rate  of  3 dB  per  octave 
because  of  rhe  bandwidth  proportionality  exhibited  by  such  spectrum 
analyzers  (Peterson  and  Gross,  1963). 

5 3 Analysis  of  the  Data 

Ihe  measured  quant  .ties  .esultmg  from  the  modified  threshold 
expe-iments  were,  after  applying  corrections  and  calibration  factors: 
a The  signal-rc-nc lse  ratio  (SNR)  needed  for  the  subjects 

to  commit  tc  a terminal  decision  of  "reasonable  certainty." 
b The  classif ication  to (R ) corresponding  to  the  exposure 
signals  A or  B. 

c.  Time  frcm  onset  of  the  response  period  to  the  time  when 


the  terminal  decision  was  made 


TABLE  4 


01  these  experimental  measurables,  the  SNR  requires  additional 
explanation.  The  SNR  has  been  variously  described  as  the  relative 
energy  over  a band  cl  frequencies,  as  the  ratio  cl  background  to 
signal  spectral  level  at  one  frequency,  or  as  the  ratio  of  integrated 
signal  and  background  noise  powers  (Gundy,  1961;  Stallard  and 
Leslie,  1974).  In  this  thesis,  the  SNR  serves  as  a reference  only 
and  is  defined  as  the  difference  in  dB  of  the  1/ 3-octave  spectrum 
level  of  the  signal  (desirable  noise-like  sound)  and  the  background 
noise  against  which  the  probe  is  presented.  Furthermore,  this 
difference  in  dB  is  measured  in  the  1/3-octave  where  the  signal  has 
the  highest  spectral  level  relative  the  background.  Without 
exception,  this  reference  1/3-octave  band  falls  at  higher  frequencies, 
above  2 kHz,  for  the  signals  used. 

Because  cf  the  step-wise  increase  of  SNR  with  time,  the  SNR 
of  the  probe  relative  background  remains  constant  for  about  two 
seconds  between  changes  However,  the  subjects  were  unable  to 
determine  it  rhe  SNR  was  increased  linearly  or  in  steps  There  were 
no  transients  which  could  have  indicated  when  the  steps  occurred. 

The  importance  of  the  time  of  response  is  related  to  possible 
short  term  memory  effects  or  due  to  the  effect  of  shadowing  (Klat2ky, 
1975;  Parkinson,  Parks  and  Kroll,  1971).  Also,  the  listener  may 
find  it  necessary  to  shift  the  criterion  for  a decision  as  time 
progresses  In  later,  chapters,  the  importance  cf  criterion  will  be 
addressed  in  relation  to  response  uncertainty  and  confidence  ratings. 
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To  analyze  the  experimental  data,  the  effect  of  applying 
different  treatments  to  various  populations  of  subjects  was 
investigated.  The  treatments  consist  of  particular  exposure  set 
and  probe  combinations.  Hence,  a situation  where  the  signal  pattern 
is  associated  with  exposure  signal  A,  signal  pattern  U)^  with 
exposure  signal  B,  and  signal  pattern  u>^  is  used  as  the  probe 
signal  in  an  ocean  ambient  noise,  is  a specific  treatment.  The 
results  of  several  treatments  may,  however,  be  combined  (pooled)  in 
some  of  the  analyses.  The  populations  consist  of  groups  of  listeners 
for  whom  the  discrimination  performance  is  expected  to  be  similar 
by  virtue  of  training  or  background.  Analysis  of  the  data  shows 
that  there  are  essentially  three  populations  involved  in  these  tests: 

a.  Student  listeners  exhibited  comparable  performance  and 
all  seven  were  found  to  apply  criterion  in  a similar 
fashion. 

b.  Submarine  sonar  operators  with  less  than  two  years 
experience  tended  to  show  similar  performance. 

c.  Submarine  sonar  operators  with  more  than  two  years 
experience  comprised  a distinct  population.  They  seemed 
to  apply  criteria  differently  from  members  of  the  other 
two  populations. 

Other  than  partitioning  the  results  according  to  these 
populations,  no  distinction  is  made  between  subjects  in  a group. 

This  method  of  analysis  is  tantamount  to  assuming  that  there  are 
no  individual  differences  for  subjects  within  a population  (Murdock, 
1968) . 
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The  numerical  analysis  of  the  data  consisted  of  computing 
pertinent  first  order  statistics  for  treatments  applied  to 
populations  and  of  statistical  hypothesis  tests  used  tc  determine 
differences  due  to  treatments  or  populations.  The  principal  tests 
were : 

a.  Calculation  cf  the  mean  SNR  to  respond  and  the 
determination  of  confidence  limits  on  the  basis  cf  the 
sample  variance. 

b.  Determination  of  the  probability  of  a correct  response 
F(C)  and  the  confidence  interval  of  the  estimate 
assuming  that  there  was  an  equal  probability  cf  correct 
response  for  each  comparable  event 

c Computation  of  the  T statistic  and  application  of  the 
t test  to  determine  if  there  is  a significant  drfierence 
be'ween  mean  SNR  (SNR)  as  the  result  cf  treatments  or 
pcpulat icns. 

To  apply  the  t.  test  for  difference  cf  means,  fifteen  to 
twenty  e- en.ts  are  sufficient  for  most  tests  (Walpole  and  Myers,  19  72;. 
This  is  due  to  the  fact  that  the  t distribution  quickly  stabilizes 
as  the  number  of  degrees  of  freedom  is  increased. 

Howeve  ■ , a large  number  cf  events  is  needed  to  accurately 
estimate  F(C).  Assuming  that  events  comprise  Bernoulli  t ials 
with  equal  probability  cf  cccur rence,  the  binomial  distribution 

t(x)  = (x)  px  (1  - p)n  x , x = 0,1,2  .n  (5  1) 


0 elsewhere 
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applies  for  x observed  correct  classifications  in  n trials  (Hogg 
and  Craig,  1970).  Murdock  (1968)  shows  that  this  binomial 
distribution  may  be  transformed  to  an  approximately  Gaussian 
distribution  using  the  transformation 

0=2  arc  sin  yx/n  . (5.2) 

The  variance  of  the  transformed  variable  will  be: 

a'i  - 1/n  . (5  3) 

o 

Hence,  6 is  n(0Q,  1/n)  where  0^  is  the  transformed  sample 
probability  of  correct  classification. 

The  90  percent  confidence  interval  on  the  probability  of  correct 
classification  is  given  approximately  by: 

[ 0 0 - Z (0. 95) / /K  < h0  < 0Q  + Z(0. 95) / /n  ] , (5.4) 

where  Z(0.95)  is  the  Z score  corresponding  tc  the  one  tailed 
normal  distribution  (Murdock,  1968).  Also, 

Z (0. 95)  = 1.69  . 

Figure  8 shows  the  behavior  of  this  confidence  interval  for  the 
two  cases  where  the  actual  probabilities  of  the  occurrence  of  a 
correct  decision  are  0.7  and  0-5,  respectively.  It  is  seen  that  the 
number  cf  events  must  exceed  about  seventy  for  the  90  percent 
confidence  interval  to  be  within  + 10  percent  of  the  correct  value. 

The  numbet  of  events  needed  to  estimate  this  parameter  is  therefore 
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greater  than  50  and  preferably  70  if  the  confidence  interval  is  to  be 
at  all  useful.  Workers  using  the  forced  response  techniques 
routinely  use  300  cr  more  events  to  estimate  each  data  point  (Swets, 
1964;  Green  and  Swets,  1966;  Robinson  and  Watson,  1972). 

Hence,  whenever  the  performance  index  P(C)  is  to  be 
accurately  estimated,  many  events  must  be  tested.  However,  tests 
for  difference  in  SNR  due  to  treatments  or  populations  can  be  done 
with  a smaller  number  of  events.  This  fact  was  taken  into  account 
in  the  design  of  the  individual  experiments. 

5.4  Subject  Training 

The  training  of  subjects  for  participation  in  the  experiments 
was  organized  to  standardize  the  subject's  criterion.  However, 
some  improvement  in  performance  would  be  anticipated  as  the  signal 
patterns  become  more  familiar. 

The  student  subjects  were  payed  a fixed  amount  for  each  session 
they  participated  in.  Prior  to  starting,  they  were  given  general 
instructions  about  the  objective  of  the  experiment  and  the  ground 
rules.  Appendix  A documents  the  basic  instructions  given  as  well  as 
the  tape  recorded  instructions.  These  latter  instructions  are 
recorded  at  the  beginning  of  each  audio  tape  used  for  these  studies 
and,  although  they  differ  somewhat  for  sonar  operators,  stress  the 
salient  features  of  experiments  for  all  subjects. 

The  students  participated  in  pilot  studies  prior  to  beginning 
sessions  using  the  modified  threshold  procedure  and  also  were  exposed 
to  a minimum  of  five  sessions  for  which  the  response  data  was  not 
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processed.  Feedback  was  given  only  when  a subject  was  obviously 
applying  criteria  differently  than  the  norm,  or  if  there  was  some 
question  about  proper  understanding  of  the  procedures.  When  the  SNR 
to  respond  early  in  the  subject's  involvement  was  compared  to  the 
SNR  to  respond  to  the  same  treatments  after  extensive  experience, 
no  significant  change  was  found. 

An  independent  test  using  a naive  subject  demonstrated  that 
there  was  no  further  improvement  in  performance  after  ten  sessions, 
fewer  than  conducted  by  the  student  subjects  prior  to  beginning  data 
collection.  In  fact,  the  naive  subject  showed  a dramatic  change 
in  performance  over  cnly  the  first  five  sessions. 

T aimr.g  :i  the  sonar  operators  consisted  of  familiarizing 
these  lis'eneis  with  the  experimental  procedure.  It  was  assumed  that 
the  sonar  training  would  have  established  their  ability  to  distinguish 
sounds  at  a high  level.  These  operators  were  given  an  initial 
interview  ar  which  the  general  objectives  of  the  experiment  were 
outlined.  Then,  pric:  to  beginning  tests  with  the  modified  threshold 
(A/Bl  response  recorder,  they  participated  in  a session  using  the 
slide  wire  technique.  This  exposure  gave  them  an  opportunity  to 
listen  to  the  sounds  over  a large  range  of  SNRs  and  to  ask  questions. 

The  first  session  with  the  A/B  response  recorder  was  a short 
one  designed  to  allow  tot  questions  by  the  sonar  operators.  This 
data  was  net  used  Application  of  the  t test  for  change  of  SNR 
between  sessions  two  and  five  exhibited  no  significant  change  as 
their  participation  in  the  experiment  continued. 
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In  order  to  investigate  more  treatments  in  the  available  time, 
groups  cf  six  events  used  the  same  exposure  signals  but  did  include 


randomization  of  probe  signals.  This  seemingly  miner  revision  of 
experimental  method  between  students  and  sonar  operators  was  found 
to  introduce  significant  questions  of  interprecation  for  the  data 

! 


Chapter  IX  presents  the  observed  effects  of  intra-group  shifts  of 
performance. 
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DISCRIMINATION  WITH  DICHOTOMOUS  BANDS  OF  NOISE 
6. 1 General 

In  studies  of  the  intelligibility  of  speech  in  noise,  it  is 
found  that  a correspondence  can  be  established  between  the  cutoff 
frequency  and  listener  performance  (Pollack,  1948).  Furthermore, 
there  seems  tc  be  a definite  relationship  between  performance  in 
various  audicmertic  tests  and  the  trained  speech  recognition  process 
(Stevens  and  House,  1972).  For  this  reason,  one  of  the  principal 
areas  ci  investigation  centered  around  the  effect  of  high  pass 
filtering  on  the  classification  performance  against  marine  sounds. 

The  study  'onsist.ed  of  four  experiments  involving  recordings  cf 
marine  sounds.  It.  addi'ion,  there  were  three  experiments  using 
customized  signal  patterns.  These  latter  experiments  were  designed  tc 
test  performance  for  signal  patterns  which  had  dichotomous  features 
approximating  those  in  the  marine  source  disc  r lmmat  icn  task. 

This  chapter  expands  on  the  model  of  the  teature  extraction  and 
hypothesis  testing  presented  in  Chapter  II.  The  detectabil i r y ot  'he 
dichotomous  feature  is  derived  and  results  are  compared  to  experiments 
cndirted  by  others  using  signal  detectability. 
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6 2 A Model  lor  Noise  Bands  Dichctcmous 

When  two  pattern  classes  differ  in  that  cne  (u;  ) has  the 

low  trequencies  eliminated  by  filtering,  the  dichotomous  feature  will 
be  defined  to  be  , a feature  which  embodies  all  of  the  low 

frequency  information.  For  realistic  signals,  this  feature  is 
associated  with  a number  of  source  auditory  parameters.  Typically, 
the  parameters  include: 

a.  A band  limited  noise  with  ncn-umform  spectral  density 
ever  a range  of  frequencies.  At  the  low  frequency  end, 
the  band  is  determined  by  the  diminution  ot  auditory- 
acuity  at  near  intra-scnic  frequencies  or  by  the 
reproduction  medium.  At  its  upper  edge,  the  band  is 
limited  by  the  cutoff  frequency  of  the  high  pass  filter. 

b.  Temporal  variations  ot  amplitude  associated  with  nen- 
srati  narity  ot  the  noise  m the  band. 

c Mere  complex  temporal  changes  in  the  band  limited  signal 
such  as  time-dependent  spectral  content  cr  tones 

For  the  purposes  cf  modeling  the  discrimination  process, 

however,  assume  that  the  dichctcmous  feature  ^ can  be 

characterized  strrctly  as  a stationary  noise  with  rectangular 

bandwidth  W ,,  and  ( or r espond ing  spectral  level  S . The 
etf  ett 

discrimination  task  then  reduces  to  a hypothesis  test  tor  the 
detection  cf  the  dichctcmous  feature.  The  two  classification 
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a.  Decide  oo  whenever  the  null  hypothesis  is  not  rejected. 

Hence,  corresponds  to  feature  ft  ^ absent, 

b.  Decide  w whenever  the  null  hypothesis  is  rejected.  Or, 

corresponds  to  feature  ft  ^ present. 

The  detectability  of  a band  of  noise  was  discussed  m Chapter  II.  The 
optimum  performance  was  given  by  Equation  (2.6). 

The  ability  of  listeners  tc  detect  bands  of  noise  in  a noise 
background  was  experimentally  investigated  by  Green  (1960a).  Green 
found  that  the  observed  detectability  in  this  situation  is  given  by: 


d\ 

obs 


= a 


d' 

opt 


0.25  < a < 0.33  , 


B l 
[• 


where  a depended  on  the  subject.  This  finding  was  seen  to  hold 

ever  a large  range  of  bandwidth  and  duration  of  the  stimulus.  In 

these  experiments,  the  SNR  was  varied  until  the  performance,  as 

measured  by  P(C),  artained  a desired  level.  In  computing  d'  , he 

obs 

used  SNRs  which  gave  a P(C)  of  about.  0.75 

Figure  9 is  a compilation  of  all  of  Green's  findings  for  the 
detect icn  cf  a noise  band  in  noise  The  data  shewn  includes  data  taken 
over  a range  of  SNRs  so  as  to  map  cut  the  psychometric  function;  the 
behavior  of  P(C)  with  stimulus  intensity  relative  background  noise. 
The  data  was  collected  using  five  subjects  and  with  the  following  set 
of  parameters: 

t = 400,  800,  1500,  2500,  4500,  6000  Hz 

c 

to  = 655,  3862,  5143  Hz 
T = 3,  10,  30,  100,  300  msec. 


i 


10  log  d'  opt 


Figure  9.  Summary  of  Green’s  Results  for  the  Detection  of  a 
Noise  in  Noise. 
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The  parameter  f is  the  band  center  frequency. 

These  results  are  for  a two  alternative,  forced  choice  (2AFC) 
test  and  the  data  is  normalized  using  the  equation 


d' 

opt 


(WT) 


1/2  "S2 


[l/2(a2/o2)2  + (ci  + a2)  + 1 

bn  bn 


-1/2 


(6.1) 


Extending  these  results  to  the  model  cf  discrimination  as  one  of 

feature  extraction  and  hypothesis  tests  on  the  energy  in  the  feature, 

the  model  predicts  the  following: 

a.  The  discrimination  performance  is  predicted  to  be 

independent  of  which  band  is  dichotomous  but  depends  on  the 

effective  bandwidth  W ... 

ef  f 

b Given  a knowledge  of  the  effective  time  duration  of  the 
stimulus  T,  the  performance  should  be  relat&ble  to  that 
observed  by  Green 

c.  The  SNR  required  to  distinguish  the  signal  patterns  is 

further  predicted  to  be  unrelated  to  other  parameters  of 

the  sound  which  are  common  to  both  sound  patterns 

An  immediate  difficulty  in  this  approach  is  the  definition  of 

the  effective  time  I,  and  although  the  SNR  was  maintained  constant 

for  approximately  two  seconds  between  step  changes,  the  listener  could 

have  integrated  over  successive  steps.  Alternately,  a time  constant 

considerably  shorter  than  two  seconds  may  apply.  While  the 

expression  for  d'  implies  continuously  increasing  detectabilltv 
opt 

with  time  T,  it  has  been  found  that  a listener  extracts  all  useful 
information  from  an  acoustic  stimulus  in  the  tirst  few  hundred 
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milliseconds  (Tanner  and  Sorkin,  1972).  Green  also  observed  chat 

d',  for  T = 1000  msec  = d'  for  T = 300  msec, 
obs  obs 

A detailed  analysis  of  the  response  behavior  for  the  modified 
threshold  procedure  indicates  that  listeners  exhibit  comparable 
behavior  in  rhis  type  of  experiment.  The  number  of  observed  responses 
as  a function  cf  delay  from  the  step  change  in  SNR  is  plotted  in 
Figure  10.  This  data  is  for  160  responses  chosen  at  random  from 
among  student  events,  and  for  180  responses  frcm  Navy  personnel 
events.  In  Figure  10,  the  independent  variable  x is  the  delay  to 
respond  from  the  onset  of  the  step  change  in  SNR.  The  peak  in  the 
number  of  responses  near  600  msec  is  significant  at  the  0.1  level  for 
student  and  sonar  operator  events  under  the  assumption  or  a Poisson 
distribution  for  the  number  of  responses  per  100  msec  of  delay.  This 
observation  tends  to  support  the  view  that  each  step  change  in  SNR 
can  be  treated  as  a detection  opportunity.  After  a delay,  the  listener 
has  extracted  all  of  the  additional  information  provided  by  the  change 
in  detectability  associated  with  tire  change  in  SNR  and  either  makes  a 
terminal  decision  or  defers  the  decision  until  another  such  change 
occurs.  It  is  noteworthy  that  this  result  is  observed  in  spite  of  the 
fact  that  listeners  were  unable  reliably  to  describe  the  way  ttr  which 
the  SNR  changed  when  asked  to  verbalize  the  auditory  sens.it  ton . Tltev 
were  unable  consciously  to  tell  if  the  SNR  changed  continuously 
in  steps! 
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On  the  basis  of  these  data,  and  in  keeping  with  models  of  the 
auditory  process,  it  was  inferred  that  an  integration  time  or  300  to 
500  msec  is  reasonable  (Dubrovskn  and  Tumarkina,  1967;  Bauer  and 
Torick,  1966),  An  additional  100  msec  delay  due  to  listener  response 
time  would  account  for  the  observed  peak  in  subject  response  delays. 
The  data  will  subsequently  be  reduced  assuming  that  the  effective 
time  is  500  msec. 

When  the  equivalent  square  bandwidth  W ^ is  determined  for 
the  three  marine  sounds  highpass  filtered  at  707  Hz,  it  is  found  that 
most  of  the  energy  in  the  dichotomous  feature  Q ^ falls  in  an 
octave  band  centered  nominally  about  500  Hz.  Hence, 


W . , = 1 72  Hz  for  Q , • 
eft  -L2/ 


At  this  pcint. , all  of  the  factors  needed  to  compute  d^  tor 
the  model  assumed  are  available  provided  that  the  SNR  to  respond  is 
known.  Hcwever,  Stallard  and  Leslie  (1974)  conclude  on  theoretical 
grounds  that  the  difference  between  a 2AFC  experiment,  as  is  Green's 
experiment,  and  detect  ion  performance  in  a sonar  cr  other  real-world 
situation  should  be  about  5.4  dB,  Their  reasoning  is  as  follows: 
a.  The  effect  of  frequency  uncertainty  because  two  bands  of 
noise  must  be  attended  to  introduces  some  decrease  in 
detectability  for  the  band  of  noise  to  be  detected.  The 
ether  band  of  noise  which  must  be  attended  to  is  one  used 


i 


1 


as  a reference  to  judge  the  background  level ■ 
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b.  Time  uncertainty  about  the  onset  time  of  the  signal  also  has 
the  same  effect.  This  uncertainty  is  due  to  the  tact  that 
such  real-world  tasks  do  not  have  the  time  periods  during 
which  the  signal  is  present  precisely  delineated  as  is  the 
case  in  2AFC  tests. 

c The  fact  that  the  task  in  the  modified  threshold  situation 
is  one  of  deciding  "yes"  the  feature  is  present,  or  "no" 
the  feature  is  not  present  adds  an  additional  factor  of 
■fl  to  the  detectability  equation.  This  difference  in 
efficiency  of  YN  and  2AFC  tests  is  equivalent  to  1.5  dB 
in  the  log  detectability  equation. 

For  these  reasons,  the  detectability  observed  in  the  modified 

threshold  tests  (d1  ) is  expected  to  be: 
mt 


d 1 = 3.5  d' 

mt  obs 


(6.2) 


theoretically,  where  d',  is  used  to  identity  the  results  observed 

obs 

by  Green.  Or, 


10  log  d'  = 10  log  d'  + 5.4  dB  . (6.3) 

mt  obs 

This  theoretical  performance  is  predicted  if,  for  each  value 
of  SNR,  the  listener  uses  only  the  information  available  in  the 
auditcry  stimulus  at  that  time.  However,  if  it  wete  possible  tor  the 
listener  to  integrate  over  successive  observaticns  as  the  SNR  increases, 
this  would  be  useful  information  and  the  uncertainty  would  be  reduced, 
or  the  detectability  would  be  higher  (Reisbeck,  1963).  In  fact,  Swets 
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and  Green  (Swets,  1964)  have  been  able  tc  demcnstrate  that,  In  very 
specialized  circumstances,  listeners  can  integrate  information  in 
successive  observations.  In  general,  however,  they  note  that:  "This 

analysis  leaves  little  doubt  that  the  assumption  of  nc  integration 
over  successive  observations  is  a good  one  . , , " (.Swets,  1964, 
p.  239). 

Note,  however,  that  the  thresholds  y and  y applicable  to 

A D 

sequential  hypothesis  testing  may  change  with  time  or  may  be  related 
to  other  factors  involved  in  the  test.  These  thresholds  affect  the 
probabilities  of  the  various  courses  of  action  but  net  the  term  ot 
the  applicable  detectability  equation  for  d^  . 

In  summary,  applying  the  model  of  a discrimination  as  a 
hypothesis  test  about  the  presence  of  a dichotomous  feature,  the 
cbservables  in  an  experiment  using  the  modified  threshold  ate  SNR, 
the  mean  s ignal-to-ncise  ratio  to  make  a terminal  decision,  and  PiC), 
the  probability  of  a correct  classification-  This  probability  will 
be  related  to  the  criterion  used  to  make  the  terminal  decision,  m 
this  case,  "reasonably  certain  " 

The  probability  P(C)  is  given  by 

P(C)  - F(ffi  |H1)  P (H1 ) + P(fi  H>  |HQ)  P(HQ)  v6  4) 


since  a correct 
1 eature  Q 
The  conditions 


decision  consists  of  deciding  correctly  that  the 
is  present  (H^ ) or  that  the  feature  is  absenr  (H^) . 
Hq  and  are  of  course  mutually  exclusive  and 


exhaust  n e 
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The  above  probability  P (co  |H^)  corresponds  to  the  "hit  rate" 
in  evenr -response  formulations  leading  to  the  definition  of  receiver 
operaring  characteristics  (Egan,  1975).  The  rate  of  incorrect 
acceptance  or  "false  alarm"  rate  is  then 


P(FA)  = P(u>_  |Hq) 


Furthermore,  referring  to  Figure  2b,  the  error  regions  under 
the  two  distributions  are: 


a.  P (FA)  = P(ui  |HQ)  = at  = / p ,H  (X|hq)  dx 

YA  ° 

b-  p<“.lJV  = 6t  = /YB  px|h2  (xiV  dx  • 


(6.5) 


(6.6) 


6 • 3 Experimental  Results  for  High  Pass  Filtered  Marine  Sounds 

The  lour  experiments  using  high  pass  filtering  of  marine  sounds 
are  summarized  in  Table  2.  The  treatments  investigated  are  fvtther 
detailed  in  Table  5 where  the  number  of  events  per  treatment  is  also 
shown.  Events  for  which  the  subjects  did  not  respond  have  been 
omitted  from  Table  5. 

From  Table  5,  it  is  observed  that  some  treatments  differ  only 
in  the  order  in  which  'he  signal  patterns  are  presented  during  the 
exposute  sequence  For  example,  Treatments  1.1  and  1.2  both  have 
the  unfiltered  marine  scund  (d^ ) as  the  probe  signal  but  differ  in 
the  fact  that,  fct  Treatment  1.1,  this  signal  pattern  is  presented 
first  and  Is  identified  as  signal  A to  the  subjects.  Also,  some  of 


TABLE 
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the  treatments  have  only  a few  events  because  results  had  to  be 
discarded  due  to  equipment  malfunction  or  noise  at  the  test  site. 

Some  treatments  which  were  planned  to  be  tested  m greater  detail 
with  sonar  operators  were  not  done  because  of  operators  net  coming 
on  time  or  failing  to  show  up  altogether.  Treatments  116,  III. 8, 

III. 12,  and  IV. 2 probably  have  too  few  data  points  to  allow  even 
testing  differences  of  means  on  SNR. 

The  first  order  statistics  for  Experiment  1 are  tabulated  in 

Table  6.  This  table  shows  the  mean  signal-to-noise  ratio  (SNR)  to 

respond,  and  the  sample  standard  deviation  S , Likewise,  these 

dNK 

same  data  are  contained  in  Table  7 for  Experiment  II  and  in  Table 
8 for  Experiment  III.  Only  this  latter  experiment  has  enough  data  to 
allow  comparisons  between  the  two  groups  of  sonar  operators,  those 
with  less  than  two  years  experience  and  these  with  two  years  experience 
or  more. 

It  is  seen  in  all  student  cases  that  treatments  which  differed 
only  in  the  order  of  r he  exposure  signals,  and  which  had  the 
dicbcrcmcus  feature  present  during  the  response  period,  yielded 
comparable  values  of  SNR  and  could  tbereiore  be  pooled  m the  data 
analysis  and  treated  as  equivalent  However,  in  every  student 
experiment  ot  this  kind,  there  was  a statistical ly  significant 
difference  in  results  for  the  treatments  where  the  probe  was  missing 
the  dichotomous  feature  (ui  ar*d  orc*et  signal  exposure 

was  dtferent  This  difference  yielded  significant  t scores  which 
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TABLE  6 

FIRST  ORDER  STATISTICS  FOR  EXPERIMENT  I, 

A DISCRIMINATION  TEST  WITH  BAND  OF  NOISE  DICHOTOMOUS 


SNR 

SSNR 

N 

I 1 

1.37 

2 72 

11 

1 2 

1.80 

2.  65 

21 

1.3 

0.71 

2.51 

24 

1.4 

-2  72 

2.74 

9 

1.5 

5 10 

2.64 

8 

1.6 

4 99 

3.03 

9 

TABLE  7 

FIRST  ORDER  STATISTICS  FOR  EXPERIMENT  II, 
signai  patterns  uj2h30 


1 1 

i 

1 r eatment 

SNR 

SSNR 

N 

11. 1 

-0-42 

2.86 

• 

! 

1 

11,2 

0.77 

3.83 

35 

_ .. 

II  3 

1.71 

2.02 

9 

1 

II  4 

! - - 

-2  00 

3 37 

9 

1 

1 

II  5 

7 19 

3 53 

8 

i 

* 

11.6 

5.20 

2 64 

. 
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Of  doubtful  significance 

. ( 


a.  T = 3.41  with  31  degrees  of  freedom  (v)  in  Experiment  1 


b.  T = 2 . 83,  V - 16  in  Experiment  II 

c.  T = 195,  V = 57  in  Experiment  III 

where 

(SNR  - SNR  ) 

T = ■ . (6.7) 

SSNR  ^1/N1>  + (1/N2> 

In  comparisons  such  as  these,  the  T score  is  significant  at 
the  0.1  level  whenever 

T > t (0.95,  v) 

and  t (0.95,  v)  is  the  single  tail  t statistic  at  the  95  percent 
level  with  V degrees  of  freedom  (Weinberg  and  Schumaker,  1962). 

This  effect  is  difficult  to  explain  since  it  goes  counter  to 
expectations  based  on  "recency"  of  the  training  stimulus  (Klat2ky, 

1 9 7 5 ) Both  groups  of  sonar  operators  also  show  a similar  dependence 

on  order  of  exposures.  Pooling  of  data  without  regard  to  exposure 
order  can  therefore  lead  to  a type  I error  when  testing  it  there  is 
a significant  difference  in  SNR  to  respond  when  the  signal  patterns 
to  and  to  u serve  as  the  probe.  Type  1 errors  are  committed 
whenever  the  null  hypothesis  is  rejected  when  it  is  true. 

Referring  again  to  Table  5,  it  is  seen  that  Treatments  1.1  and 
1.2,  1.3,  and  1.4,  etc.,  ate  similar  in  exposure  order  and  only  differ 
in  the  probe  used.  Using  the  case  where  w is  exposed  first  in  the 
treatment,  the  point  estimate  of  SNR  is  obtained  under  hcpefullv 
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similar  experimental  conditions.  The  results  are  plotted  m Figures 
11,  12,  13,  and  14  for  both  student  subjects  and  sonar  operators 
In  some  cases,  however,  the  case  where  signal  pattern  a)  was 

exposed  first  is  used  instead.  This  is  the  case  when  comparison  to 
limited  sonar  cpetatcr  data  was  desired  as  is  the  case  for  Experiment 
I.  These  figures  show  the  signal  excess  over  ncise  in  dB  in  third- 
octaves  for  the  total  frequency  range.  Note  that  a change  in  SNR  m 
the  modified  threshold  experiment  leads  to  a simple  translation  of  the 
signal  excess  curve  up  or  down.  The  figures  graphically  demonstrate 
how  far  "out  of  the  noise"  the  signal  must  be  for  the  listeners  to 
obtain  enough  data  to  commit  to  a terminal  decision. 

In  addition,  it  is  seen  that  in  most  cases,  the  SNR  to 
discriminate  when  the  dichotomous  feature  was  present  is  comparable 
to  that  needed  to  discriminate  when  the  dichotomous  feature  was 
absent.  In  the  cases  where  the  SNR  was  significantly  diilerent  for 
the  twc  probe  conditions,  the  observation  may  be  the  'esvir  :t  the 
major  effect  cf  exposure  order. 

For  these  treatments  where  there  was  a significant  ditference 
between  discrimination  for  the  two  probe  conditions,  tie  data 
indicate  that  the  oNR  tor  the  terminal  decision  was  Lcwe  r when  rhe 
dichotomous  tea-ure  was  absent.  These  treatments  were: 

a.  12  and  1.4,  T = 4 24  for  28  degrees  of  treedem  f.) 

b.  II. 2 and  II  4,  I = 1.98,  V = 42 

c.  III.l  and  111.3,  T = 2.09,  v * 59  . 


FREQUENCY  (kHz) 


Figure  11.  Signal  Excess  in  One-Third  Octave  Bands  at  the  Terminal 
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Figure  13.  Signal  Excess  at  the  Terminal  Decision  for  Signal  Patterns 


and  a) 


3H30’ 


80 


The  data  in  Table  8 also  show  that  the  population  of  listeners 
consisting  of  sonar  operators  with  little  at-sea  experience  exhibit 
a dramatically  higher  standard  deviation  in  SNR  to  respond  than  do 
either  University  students  or  highly  experienced  operators.  To  test 
if  this  standard  deviation  (cr  variance)  is  in  fact  signif icanr ly 
greater,  the  F statistic  is  computed  (Walpole  and  Myers,  1972) 
However,  unlike  the  T statistic,  which  is  very  distribution 
insensitive,  the  F statistic  is  valid  only  whenever  the  distribution 
of  observations  can  be  assumed  normal.  The  observed  SNR  tc  respond 
is  plotted  in  Figure  15  fcr  scnar  operators  with  Experiment  III  The 
figure  shows  that  an  assumption  of  normally  distributed  observations 
is  not  unwarranted.  The  observed  difference  in  the  va  ance  exhibited 
by  the  low  experience  sonar  operators  against  that  ct  the  University 
students  and  experienced  sonar  operators  was  m no  case  significant 
at  the  0- : level.  Hence,  the  difference  could  simply  be  due  tc  chance. 

The  other  .quantity  measurable  in  the  modified  threshold 
exper  ‘.me’- 1 c :s  the  pcbab^iity  of  various  decision  al ' etna' ives  The 
experimental  results  are  summarized  in  Table  9 Th^s  'able  gives  both 
'he  point  estimate  and  90  percent  confidence  interval  cr  the 
probabilities. 

The  two  alternatives  corresponding  to  correct  decisions  are: 

a F(i  | H ) , the  probability  of  correct  accep'ance,  or 
hit  rate. 

b.  P(ui  |H  ),  the  probability  of  correct  reiecticn  when 
the  dichotomous  feature  fl  is  absent. 
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In  addition,  the  resulting  probability  cf  a correct  response, 
correct  acceptance  cr  rejection,  is  alsc  shewn  The  table  clearly 
shews  hew  large  the  confidence  intervals  on  prebabilit res  is  unless 
a large  number  of  data  points  is  available  Only  'treatment  results 
with  sufficient  data  points  to  give  seme  credibility  tc  tr.e  point 
estimate  are  shown  Data  are  pooled  across  presentation  order. 


6 . 4 Relationship  Between  Results  for  High  Pass  filre:ing 
and  the  Model 

Tc  determine  how  well  the  experimental  results  relate  tc  the 

model,  the  data  are  compared  to  those  obtained  by  Green  (1960a)  as 

described  in  Section  62  In  that  section,  the  normalization  equation 

was  p-e^enred  and  the  effective  bandwidth  W ,,  was  shewn-  Ihe 
r eft 

effective  spectral  le' el  S ^ is  then  obtained  by  noting  -he  SNR 
to  tespo'd  experimentally  Once  this  value  is  known,  thf  spect-ai 
le’el  jf  the  d-'hetemous  bard  ct  ncis?  £1  : s obtained  direr  rly 

frem  'he  calibration  data 

The  pc  .nt  and  i"ervai  estimates  icr  the  stietrive  .ar.es  of 
s i gra 1-r c-r : i se  power  in  the  dichotomous  band 


S*  / N ‘ - c~-c‘ 

eft  On 


ate  shown  ;n  Table  i0.  The  table  also  shews  the  p.  rt  zr.d  interval 
estimates  cn  the  quantify 


1°  log  d*  , 
mr 


the  observed  detectability  for  the  feature  normalized  using  Equation 


(6.1).  The  subscript  mt  is  used  to  designate  data  obtained  using 
modified  threshold  techniques 

Note  that  the  dichotomous  feature  ^2L27  resultmg  from  high 
pass  filtering  at  354  Hz  exhibits  a detectability  which  is  ccmparable 
to  that  found  for  the  features  (2  resulting  from  high  pass 

filtering  signal  patterns  and  at  707  Hz 

When  all  of  the  data  for  high  pass  filtering  of  marine  sounds 
are  compared,  it  is  seen  that  the  performance  fcr  the  various  signal 
patterns  is  comparable.  The  results  are  plotted  in  Figure  16  alcng 
with  a fourth  order  polynomial  regression  fit  to  Green's  da'a  (Green, 
1960a;  McGill,  1968)  The  theoretically  predicted  correction  for  a 
sonar  cr  ether  industrial  environment  detection  problem  is  also  shewn 
(Stallard  and  Leslie,  1974) 

Figure  16  shows  that  University  student  listeners'  per tormance 
falls  very  nei'  thar  predicted  by  the  model  Also,  senar  cpecators 
seem  to  consistently  require  a higher  SNR  to  respond  but  they  are 
net  able  tc  improve  their  performance  in  terms  of  PiC)  proport i narel\ 
In  'he  cne  experiment  whe-e  data  of  inexpe* lenced  s experienced 
sonar  operators  were  sufti:ient,  the  experienced  cpe.atc:  d d 

an  apparent  improvement  in  F(C)  at  a higher  SNR.  Thi~  d» 

P(C)  is  not  statistically  significant,  hewever 

The  consistency  of  these  findings , m spite 
differences,  is  gratifying  especially  when  t 
simplifying  assumptions  are  etnbcdied  m the  - 
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Figure  16.  Comparison  of  Experimental  Results  with  the  Predicted 
Performance  for  the  Detection  of  Noise  in  Noise. 
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as  performing  a feature  extraction  followed  by  a hypothesis  test  based 

on  energy  in  the  dichotomous  feature.  Just  the  uncertainty  of  the 

exact  effective  bandwidth  W , , to  use,  and  that  of  the  uncertainty 

ef  f 

of  effective  time  T,  could  account  for  nearly  2 dB  in  the  resulting 

value  of  d ' 
mt 

So  far,  the  problem  has  been  reducible  to  one  of  detection  of  a 
feature.  Certainly  the  present  discrimination  task  is  a more  complex 
one  than  implied  in  the  model.  For  example,  whenever  all  energy  above 
the  cutoff  frequency  used  here  is  removed,  the  model  predicts  the  same 
results  since  it  does  not  make  use  of  the  information  in  this  band. 

The  performance  is  derated  by  a factor  to  allow  for  listening  cc  two 
bands  of  energy  but  nc  other  assumptions  about  how  this  energy  in  the 
fixed  portion  of  the  signal  is  used.  Hcwever,  were  the  higher 
frequency  features  removed  in  practice,  the  listener  behavior  would 
be  much  different.  The  present  results  show  that  the  SNR  tc  make  a 
terminal  decision  is  not  much  different  when  the  feature  fi  ^ is 
present  or  absent  With  nc  other  information,  hcwever,  rhe  listener 
would  never  be  able  to  matte  a terminal  decision  with  The  feature 
absent  since  nothing  would  serve  as  a reference  as  tc  how  rar  the 
signal  was  cut  of  the  noise. 

Clearly,  then,  the  listener  is  simultaneously  using  various 
types  of  information  in  the  signal  pattern  to  make  the  terminal 
decision  required  ip  the  modified  threshold  paradigm.  In  order  to 
obtain  better  insight  as  tc  the  role  of  the  various  features  of  the 
signal,  laboratory  generated  signals  were  used.  The  cbavar  ’•et  lstics 
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of  these  signals  are  summarized  in  Table  3.  Table  4 lists  the 
experiments  performed  with  these  sounds.  Of  these  experiments. 
Experiments  VIII,  IX,  and  X are  applicable  to  the  study  of 
discrimination  where  the  dichotomous  feature  is  a band  of  noise.  The 
dichotomous  feature  in  these  cases  is  !1  an  octave  band  of  noise 

centered  at  500  Hz.  The  fixed  feature  was  an  octave  band  of  noise 
centered  at  4 kHz  and  was  chosen  to  have  three  distinct  variations 
which  serve  as  the  factors  for  which  the  discrimination  performance 
is  measured.  This  feature  is  designated  fi,*.,,,,  or 

Table  11  lists  the  experimental  results  using  these  signals. 

The  exposure  order  effect  previously  mentioned  seems  to  be  present  in 
this  data  as  well.  Note,  however,  that  for  Experiment  VIII  there  is 
a significant  and  consistently  higher  SNR  to  discriminate  when  the 
dichotomous  feature  was  absent.  Using  the  results  for  Treatments 
V1II-1  and  V11I-2,  the  detectability  d^  is  obtained  for  the  two 
cases  differing  In  the  absence  cr  presence  of  the  feature  ^B27‘ 

These  results  were: 

a.  10  log  d'  under  H,  is  9.4  < 10.0  < 10-4  at  the  90  percent 
opt  1 

confidence  level 

b-  10  log  d'  under  H..  is  10.8  < 11.1  < 11.4 
6 opt  0 

To  investigate  this  effect  in  greater  detail,  the  way  in  which 
the  listener  recognizes  that  the  signal  is  sufficiently  far  above 
the  noise  to  determine  that  the  dichotomous  feature  is  absent  is  now 


addressed . 
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A possible  model  of  the  process  is  shown  In  Figure  17.  This  model 
is  an  application  of  Neisser's  theory  for  pattern  recognition  by 
humans  (Reed,  1973).  The  model  shows  that  the  listener  must  do 
feature  extraction  and  hypothesis  testing  on  both  the  dichotomous 
feature  and  the  invariant  feature  to  decide  that  the  feature  is 
absent,  i.e.,  The  feature  extraction  is  here  shown  as  being 

done  in  parallel  which  is  one  of  the  salient  features  of  Neisser's 


theory  This  aspect  of  the  model  is  not  critical,  however,  and  it  is 
intended  only  to  show  the  possible  decision  sequence.  This  simple 


model  demonstrates  that  the  decision  is  independent  of  other 

characteristics  of  the  signal.  Threshold  y corresponds  to  the  one 

D 

in  Chapter  II  for  sequential  decision  making.  The  threshold  y 

is  the  same  kind  cf  threshold  applied  to  a decision  about  the 

presence  of  the  high  frequency  band.  The  decision  constants  y„ 

L)  hi 

determine  the  threshold  y shown  in  Chapter  II. 

From  Figure  17,  it  is  seen  that  the  decision  tc  report  that 
the  feature  ft  is  absent  requires  also  that  feature  ft  be 

at  least  detected.  From  the  experimental  results,  the  detectability 
of  this  feature  was,  for  Experiment  ViII, 

a.  10  log  d'  „ for  ft7_0,  = 10-3  < 11.0  < 11. 6 for  ftD07 

opt  7B36  B27 

present, 

b.  10  log  d'  for  ft-,-.,  = 12.5  < 13.1  < 13.6  fcr  ft,..., 

° opt  7B36  B27 

absent 

where  the  90  percent  confidence  intervals  are  shown.  It  appears, 
therefore,  that  the  threshold  for  deciding  on  the  observables 
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H1 


Figure  17.  A Model  of  1 
Clues  Needec 
is  Absent. 
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from  the  high  frequency  band  is  about  3 dB  higher  thar  'hat  needed  to 
detect  chat  band  Here  it  is  assumed  that  the  high  frequency  band 
requires  a detectability  comparable  to  that  for  the  lew  frequency 
band  to  satisfy  the  first  decision  in  Figure  17. 

Expet iment  IX  is  identical  to  the  experiment  derailed  above 
except  that  the  spectral  level  of  the  high  frequency  band  was  increased 
6 dB  relative  to  that  in  the  low  frequency  band.  The  data  icr  this 
experiment  is  summarized  in  Table  11.  Since  the  SNR  measured  in  these 
experiments  is  for  a thud-octave  band  at  the  higher  irequencies 
relative  to  the  background  noise,  the  data  in  the  table  actually 
relate  to  how  tar  cut  of  the  noise  the  high  frequency  band  was  at  the 
terminal  decision:  Because  of  limited  data.  Treatments  iX  2 and  IX. 4 

were  used  in  the  analyses  tc  give  the  following  results; 

a 10  leg  d'  unde:  H,  is  9 1 -<9  4 <9.8  at  the  90  percent 

opt  1 

conridence  ie  el, 

b-  10  log  under  is  9-2  <9  7 <10.2 

The  exper  imect  al  data  show  that  in  this  case  the  per  reman  e 

under  H,  and  are  nearly  l<ienricai.  This  tends  to  lend  credence 

to  the  model  cl  the  p • or  ess  as  postulated  For  this  case,  the 

threshold  , figure  17)  would  be.  surpassed  by  the  ebser  cables  in 

the  high  frequency  bard.  The  listener  must  make  an  audit „_nal 
comparison  shewn  as  Decision  3 in  Figure  17  which  is  related  tc  how 
rar  the  high  irequency  band  is  cut  of  the  noise  and  which  ee°ms  to 
require  an  accurate  mental  image  of  the  sound.  The  listener  m 
effect  must  remember  how  the  low  and  high  bands  were  related  and 
match  this  to  the  background  noise  as  well 
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In  Experiment  X,  the  fixed  feature  was  an  amplitude  modulated 
band  of  noise  which  was  very  easily  recognized  in  the  background. 
According  to  the  model  discussed  in  this  section,  this  experiment 
should  be  analogous  to  Experiment  IX  in  that  the  feature  common  to 
both  signals  has  a high  detectability.  The  experimental  results  for 
the  detectability  needed  to  commit  to  a terminal  decision  were: 

a 10  leg  d'  . under  H-  is  11  1 < 11-4  < 11.6  at  the  90  percent 
opt  1 

confidence  level, 

b.  10  leg  d'  . under  H.  is  10.5  < 11.1  < 11.5 
° opt  0 

The  SNR  to  respond  was  comparable  for  the  cases  where  the 
dichotomous  feature  was  absent  and  when  it  was  present.  This  finding 
agrees  with  what  was  predicted  above.  However,  the  SNR  was 
significantly  higher  than  that  observed  in  Experiments  VI 11  and  IX 
for  the  situation.  One  is  tempted  to  attribute  this  effect  to 

the  presence  of  the  highly  detectable  feature  interacting  with  the 
ability  of  the  listener  to  concentrate  his  attention  cn  the  band  of 
noise  to  be  detected.  The  subjects,  in  fact,  complained  that  they 
were  having  a hard  time  attending  tc  the  task  because  the  amplitude 
modulated  component  was  so  "monotonous  " However , the  amount  cf  data 
available  as  the  result  of  these  studies  dees  not  allcw  for  any  such 
conclusion  This  is  especially  true  because  of  the  seeming  dependence 
of  results  on  exposure  order. 


.4 


CHAPTER  VII 

EXPERIMENTS  WITH  OTHER  DICHOTOMOUS  FEATURES 

7.1  Introduction 

Studies  investigating  speech  intelligibility  have  shown  that  the 
temporal  structure  of  signals  is  important  in  the  determination  of 
listener  performance  (Licklider,  1951;  Miller,  1948).  In  order  to 
investigate  this  aspect  of  noise-like  sounds,  two  sets  of  experiments 
were  performed  using  the  modified  threshold  procedure.  These 
experiments  compared  classification  performance  when  the  exposure  set 
consisted  of  two  signals  which  were  spectrally  similar  but  differed  in 
the  signal  envelope  structure.  The  results  of  these  experiments  are 
presented  in  this  chapter  along  with  an  analysis  of  the  results  from 
the  point  of  view  of  a feature  extraction  and  hypothesis  test. 

Also  discussed  in  this  chapter  is  an  experiment  using  University 
student  subjects  only  which  used  "soft"  limiting  of  noise-like  sounds. 
The  finding  that  such  distortion  of  the  time  waveform  of  the  signal 
has  little  effect  on  its  subjective  character  is  noted  and  similarities 
to  speech  distortion  tolerance  is  cited  as  a possible  explanation. 

7.2  Changing  the  Temporal  Structure  of  Sounds 

In  Table  1 are  characterized  the  marine  sounds  used  in  these 
studies.  The  two  sounds,  and  u^,  which  have  a pronounced 

rhythmic  quality,  exhibit  a temporal  shift  in  the  amplitude  with  time. 
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This  amplitude  change,  or  modulation,  is  quasi-per iodic  with  a rate  of 
between  7 and  10  Hz  Since  the  temporal  amplitude  shift  of  speech 
signals  is  known  to  convey  a significant  amount  of  the  information 
about  the  signal,  changes  in  the  temporal  structure  oi  the  marine 
sounds  is  expected  to  change  its  subjective  character  considerably 

Two  experiments  were  conducted  using  both  student  and  sonar 
operator  subjects  where  the  modulation  of  the  sound  was  the  dichotomous 
feature  (ft  ).  An  additional  experiment  using  laboratory  generated 
sounds  was  also  included.  In  this  case,  the  amplitude  modulation 
was  a 10  Hz  square  wave.  These  experiments  are  also  tabulated  in 
Table  2. 

The  tape  recorded  marine  sounds  were  treated  to  eliminate  the 
amplitude  modulation  by  using  a fast  automatic  gain  control.  This 
method,  without  changing  the  overall  spectrum  of  the  sound,  effectively 
eliminates  all  amplitude  modulations  slower  than  about  70  Hz  The 
rhythmic  quality  ct  the  sound  is  effectively  eliminated  by  this 
treatment. 

The  results  of  modified  threshold  tests  using  student  or  sonar 
operator  listeners  are  summarized  in  Table  12.  Sonar  operators  are 
treated  together  e~en  though  the  results  in  Chapter  VI  indicate  that 
there  may  be  some,  significant  differences  between  operators  with  less 
than  two  years  experience  and  highly  experienced  ones.  There  is  not 
sufficient  data  tc  allow  splitting  the  data  by  operator  sub-group. 

The  results  in  Table  12  show  that  this  type  of  experiment  does 
not  seem  to  exhibit  dependence  on  the  exposure  order  as  did  experiments 
where  the  dichotomous  feature  was  a band  of  noise.  The  one  instance 
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where  such  an  effect  may  be  present  is  between  Treatments  VI. 3 and  VI. 4 
but  this  is  of  doubtful  significance  because  of  the  small  number  of 
data  points  associated  with  the  former  treatment  and  because  ct  cne 
doubtful  data  point  at  an  unusually  low  SNR. 

Also,  as  with  results  presented  in  Chapter  VI,  the  sonar 
operator  subjects  responded  at  a significantly  higher  SNR  than  did  the 
student  listeners  This  difference  :.s  about  7 dB.  Subsequent  tests 
using  a graduate  student  to  repeat  the  tests  shew  that  the  results 
obtained  by  the  student  subjects  are  again  attainable.  The  difference 
between  the  two  populations  of  listeners  seems  to  be  cne  of  criterion 
The  sonar  operators  appear  to  be  more  conservative  in  their  terminal 
decisun  Such  differences  in  criterion  should  reflect  in  the 
performance  as  measured  by  the  probability  of  a correct  decision.  In 
Table  13,  the  significant,  probabilities  are  summarized  ter  tests  with 
temporal  features  dichotomous 

The  data  shew  that  the  sonar  operators'  performance,  if  anything, 
tends  to  be  somewhat  less  reliable  than  the  students’  Ihe  large 
confidence  intervals  preclude  disproving  the  null  hypothesis,  ..  e , 
there  is  no  difference  between  populations,  at  the  0 1 le  el,  however. 
The  value  of  P(C)  exhibited  by  fhe  students  in  ExpenmenT  V is  high 
and  the  sonar  operator  data  is  somewhat,  tainted  by  the  iac  that  seme 
cf  the  less  experienced  operators  exhibited  a surprisingly  high  e.rcr 
rate  cn  this  experiment  There  is  not  enough  data,  however,  to  allcw 
for  definite  conclusions.  It  is  noteworthy,  however,  rhat  two  ct  the 
mest  experienced  sonar  operators  made  no  errors  cn  ary  ct  rte  events 
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where  the  dichotomous  feature  was  the  amplitude  modulation  or  the 
signal.  These  same  operators  were  very  conservative  in  their 
responses,  however,  and  tcutmely  responded  at  high  values  of  SNR 

7 . 3 Analysis  of  Results  Obtained  in  Experiments  with  the 

Temporal  Characteristic  Modified 

In  the  experiments  with  signals  treated  to  remove  amplitude 
modulation  vs  the  recorded  marine  sound,  the  dichotomous  feature  was 
the  presence  of  amplitude  modulation  (fi  ) . This  feature  appears 

Arl 

as  a repeated  burst  of  noise  impressed  upon  a steady  background. 

Miller  and  Taylor  (Miller,  1948;  Miller  and  Taylor,  1948;  investigated 
the  subjective  character  of  this  type  of  signal.  They  found  that  the 
differential  threshold  for  intensity  AI/I  increases  as  the  duration 
ct  the  added  burst  of  noise  decreases.  In  the  limit  where  the  duration 
cf  the  added  increment  exceeds  about  250  msec,  the  performance  is  given 
fcy  the  Weber  fraction 

cl/ I » 0.1,  t > 250  msec  (Green,  1960a)  , 

10  log  Al'I  : -10  dB  . 

For  the  marine  sounds  in  question,  the  natural  modulation 
corresponding  to  feature  is  in  the  term  cf  short  bursts  cf  noise 

which  are  repeated  mere  cr  less  periodically  at  a rate  of  7 to  10  Hz. 
Ibe  duration  zi  the  noise  pulse  is  between  25  and  50  msec  but  with  a 
non- rectangular  wavetorm.  Figures  11  and  12  show  that  the  eftective 
bandwidth  of  the  signal  vs  noise  background  is  approximately  6 kHz 
from  400  Hz  tc  6400  Hz  at  the  3 dB  down  points.  The  Weber  fraction 
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is  then  the  incremental  change  in  intensity  vs  average  intensity 


just 

perceptible. 

Mcore  and 

Raab  (1975) 

find  that  the  Weber  traction 

is: 

a. 

-1.4 

dB 

for 

3160 

Hz 

bandwidth 

and  duration 

10  msec . 

b . 

-0.9 

dB 

for 

1000 

Hz 

bandwidth 

and  duration 

10  msec . 

c 

-5.5 

dB 

tor 

3160 

Hz 

bandwidth 

and  duration 

250  msec  - 

d 

-4.9 

dB 

fcr 

1000 

Hz 

bandwidth 

and  duration 

250  msec . 

e. 

-7.3 

dB 

for 

an  18  kHz  bandwidth  and  duration  250  msec. 

For  Experiment  V,  the  central  12.5  msec  duration  of  the  peak  in 
amplitude  was  measured  to  be  4 dB  relative  the  average  signal 
amplitude.  Using  this  fact,  and  the  results  shown  in  Table  12,  the 
following  can  be  calculated: 

a.  The  average  signal-plus-noise  level  relative  to  the 
background  noise  alone  was  1.65  dB. 

b.  The  point  estimate  of  signal-plus-noise  level  relative  to 
the.  background  during  the  peak  of  the  envelope  amplitude 
excursion  was  3.25  dB. 

c.  The  ratio  cf  peak  signal-plus-ncise  ratio  to  average  signal- 
plus-noise  ratio  was,  therefore,  observed  to  be  1.6  dB. 

The  peak  amplitude  was  measured  using  the  transient  capture 
mode  cf  a real-time  spectrum  analyzer.  This  peak  amplitude  was  then 
compared  tc  the  long-term  average  which  included  both  the  high 
amplitude  and  low  amplitude  signals. 
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After  normalizing  the  average  intensity  to  unity,  the  findings 
above  show  that  the  ratio  of  intensities  at  the  peak  of  the  amplitude 
to  the  average  is: 

(I  + AI)/I  = 1.45/1.0 
or 

AI/I  = 0.45/1.0 

10  log  (AI/I)  = -3.5  dB  . 

Correspondingly,  the  confidence  intervals  are  given  by: 

-4. 3 < AI/I  < -3. 3 dB  . 

When  this  result  is  compared  to  those  of  Moore  and  Raab  (1975), 
it  is  seen  that  the  present  observed  SNR  to  respond  agrees  reasonably 
well  The  pulses  ccmprising  the  modulation  fall  between  these 
investigated  by  them  and,  therefore,  cannot  be  directly  compared  with 
the  present  results.  However,  extrapolating  their  results  using  the 
empirical  method  proposed  by  them,  the  predicted  value  of  AX/ 1 would 
be  about  -40  dB.  The  difference  between  this  predicted  value  and 
the  results  obtained  using  the  modified  threshold  procedure  is 
certainly  net  unreasonable  considering  the  simplifying  assumptions- 
Hence,  as  for  the  classification  of  the  high  pass  filtered  signal, 
the  results  observed  can  be  explained  reasonably  well  by  simply 
assuming  that  the  problem  is  one  of  detecting  a dichotomous  feature. 

It  must  be  noted,  however,  that  treating  these  pulse-like  increases 


j 


| 
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in  the  amplitude  as  isolated  pulses  to  be  detected  in  a noise  background 
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must  be  done  with  caution.  These  pulses  occur  often  enough  sc  that 
they  are  approaching  an  indistinguishable  series  of  pulses  as 
investigated  by  Miller  and  Taylor  (1948).  They  state  that  the 
critical  frequency  is  about  20  Hz  and  is  also  a function  of  duty 
factor 

In  the  absence  of  amplitude  modulation,  the  listener  is  faced 

with  the  task  of  first  detecting  the  presence  of  the  unmodulated  band 

of  noise  against  the  background.  This  detection  is  analogous  tc  the 

case  treated  in  Chapter  VI  where  now  W is  about  6000  Hz 

ef  f 

(Experiments  VIII,  IX,  and  X).  Given  the  point  estimate  cn  SNR  shewn 
in  Table  12  and  the  1/3  octave  calibration  data,  the  effective  signal- 
tc-ncise  ratio  is 

S/N  = -4.8  dB  , 
which  yields  a detectability  of 

10  log  'opt  * 10.7  . 

This  detectability  compares  favorably  to  that  measured  for  the 
experiments  where  bands  of  noise  were  dichotomous  (See  also  Figure  16). 
In  this  case,  however,  the  difference  between  the  SNR  tc  detect  the 
unmodulated  noise  and  the  SNR  to  make  the  terminal  decision  does  net 
appear  to  be  as  gteat  as  that  needed  in  Experiment  VIII,  although 
there  is  sufficient  uncertainty  about  the  effective  bandwidth  that  a 


statistical  test  is  not  warranted. 
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For  Experiment  VI,  the  peak  amplitude  relative  the  long  term 
average  was  measured  to  be  about  2.0  dB  although  the  natural  noisiness 
of  the  envelope  makes  this  quantity  a difficult  one  to  measure.  From 
Table  12,  Treatments  VI. 1 and  VI. 2 the  average  SNR  is  2.78  dB,  hence, 
the  average  signal-plus-noise  to  noise  ratio  4 63  dB  and  the  peak 
signal-plus-noise  to  noise  ratio  is  6.03  dB.  Therefore,  the  peak 
signal-plus-noise  ratio  is  1.40  and 

(I  + AI)/I  = 1.38/1.0 
or 

AI/I  = .38/1  0 

10  log  (AI/I)  = -4.2  dB  . 

This  result  is  again  near  that  extrapolated  from  the  earlier 
experimental  evidence.  For  this  signal,  however,  the  effective 
duration  cf  the  peak  in  envelope  amplitude  is  more  variable  than  that 
for  Experiment  V,  and  there  is  a considerable  uncertainty  about  the 
value  of  pulse  duration  to  use.  However,  the  results  are  encouraging 
When  the  amplitude  modulation  is  absent,  the  SNR  to  respond  is,  from 
Table  12,  abcut  5.9  dB  when  data  for  all  sonar  operators  is  pooled. 
This  result  yields  a detectability  of 

10  log  d'  _ = 13.8 
opt 

which  is  considerably  higher  than  was  noted  for  other  experiments 
dealing  with  the  detection  of  bands  of  noise.  This  result  may  be  due 


to  some  residual  periodic  change  in  the  spectrum  of  the  signal,  after 
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eliminating  the  amplitude  modulation,  which  is  heard  as  a modulation 

of  the  frequency  structure  of  the  signal.  However,  a difference  in 

the  classification  decision  rules  could  also  be  active  and  indeed  this 

does  seem  to  be  the  case  in  the  experiment  discussed  below. 

In  Experiment  VII,  the  uncertainties  associated  with  the  marine 

sounds  are  eliminated,  The  sound  pattern  in  this  case  consists  of  two 

octave  bands  of  stationary  noise  where  the  lower  frequency  one  is 

either  amplitude  modulated  by  a 10  Hz  square  wave  or  is  not.  For  this 

case,  feature  has  an  effective  duration  of  50  msec  and  the  peak 

4AM 

in  spectral  level  is  3 dB  higher  than  the  average  for  the  modulated 
band . 

Pooling  data  in  Table  12,  the  required  SNR  when  the  modulated 
feature  is  present  is  1,92  dB.  From  the  calibration  data  and  this 
observed  mean  signal-' o-noise  ratio  to  respond,  the  band  level  at 
the  response  was  4,7  dB  relative  to  the  background.  The  observation 
yields  the  following  measures: 

10  log  [ ( S + n)/n]  = 6.0  , 

where  S is  the  average  signal  intensity,  and 

10  iog  [ (S  ■+■  n/N)]  = 8.4  , 

where  is  the  peak  value  of  signal  intensity,  hence, 

(I  + AI)  / 1 = 1.73/1.0 
or 


10  log  (AI/ I ) = -1.37  . 
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Hence,  for  Experiment  VII,  the  observed  intensity  discrimination 
performance  was  -1.4  dB  for  354  Hz  bandwidth  and  duration  50  msec. 

This  result  also  falls  within  the  anticipated  performance  bounds 
determined  from  data  measured  by  Moore  and  Raab  (1975) . 

When  the  amplitude  modulation  feature  is  absent,  the  problem 
reduces  to  detecting  the  band  of  noise  centered  at  500  Hz  and  then 
deciding  that  the  amplitude  modulation  is  absent.  Comparing  Tables 
11  and  12,  it  is  seen  that  the  decision  in  favor  of  H^  when  the 
dichotomous  feature  was  amplitude  modulation  is  made  at  a SNR  which 
is  some  6 to  8 dB  higher  than  when  the  dichotomous  feature  was  the 
low  frequency  band  of  noise  itself.  The  difference  between  these 
two  sets  of  data  clearly  demonstrates  that  the  listener  alters  the 
decision  rules  in  response  to  the  task  to  be  performed.  The 
detectability  of  the  unmodulated  band  of  noise  is  physically  the 
same  for  Treatments  IX. 1 and  IX. 2 as  it  is  for  Treatments  VII. 3 and 
VII. 4 but  the  response  level  is  significantly  different.  The 
difterence  seems  to  be  due  to  the  fact  that  the  listener  must  obtain 
sufficient  information  to  decide  that  the  band  of  noise  being 
attended  to  is  both  present  and  not  amplitude  modulated. 

7-4  The  Effect  of  Distortion  of  the  Signal  Waveform 

To  test  the  effect  of  limiting  on  signals,  the  tape  recorded 
signal  was  passed  through  a non-linear  circuit  with  the  transfer 
characteristic  shown  in  Figure  18.  The  input  level  was  so  adjusted 
that  limiting  began  at  the  la  point  of  the  time  waveform.  Such  a 
transfer  function,  while  modifying  the  detailed  amplitude  vs  time 
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Figure  18.  Time  Domain  Transfer  Function  of  a Limiting  Circuit 
Used  to  Vary  the  Temporal  Structure  of  the  Sound. 


structure  of  the  signal,  does  not  greatly  reduce  the  intelligibility 
of  speech  (Licklider , Bindra,  and  Pollack,  1948). 

Five  treatments  using  two  marine  source  recordings  were 
investigated  using  student  subjects.  Of  the  53  events,  only  18 
resulted  in  a terminal  decision  and,  of  these,  55  percent  were  correct 
classifications.  The  treatments  and  number  of  events  tested  in  each 
case  are  summarized  in  Table  14.  The  responses  and  listener  comments 
indicate  that  there  is  little  subjective  difference  in  the  sounds 
attributable  to  the  limiting  process.  Those  responses  which  were  made 
occurred  at  a SNR  near  the  maximum  presented.  At  this  high  SNR,  the 
signals  are  little  contaminated  by  the  background  noise  and  the  task 
is  one  oi  matching  an  essentially  pure  probe  to  the  exposed  signal  set. 
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TABLE  14 

SUMMARY  OF  TREATMENTS  TO  INVESTIGATE  THE 
EFFECT  OF  la  LIMITING  OF  A MARINE  SOUND 


Treatment 
No . 

Exposure  Sequence 
A B 

Probe 

No.  of  Eventsl 
Tested 

XI.  1 

W2 

- ~ 

w21a 

oj2 

' ' ' 1 

11 

XI.  2 

W2 

21a 

W21  a 

U 

Xll.l 

W3 

w31a 

"3 

10  i 

X11.2 

W3 

w3la 

“31a 

n | 

X11.3 

w31o 

w3 

w31a 

O 

L 

la 


is  the  signal  pattern  limited  at  the 


point . 


CHAPTER  VIII 


CRITERION  AND  RESPONSE  BIASES 

8 . I Older  and  Training  Effects  vs  Criterion 

The  results  presented  in  Chapters  VI  and  VII  demonstrated  that 
the  dara  are  influenced  by  the  experimental  technique.  In  addition, 
that  data  hinted  at  a significant  difference  in  the  way  sonar 
operators  and  trained  University  student  listeners  perform  in  the 
discrimination  task. 

The  data  is  reviewed  in  this  chapter  in  an  effort  to  identify 
the  effect  various  aspects  of  the  experimental  technique  may  have  on 
the  outcome  cr  the  experiment.  The  effect  of  time  into  the  response 
period  will  be  investigated.  It  is  seen  that  one  difficulty  in  using 
threshold  schemes  such  as  the  "modified  threshold  procedure"  is  that 
it  is  difficult  to  separate  the  dependence  of  response  SNR  on  the 
detectabiliry  of  the  dichotomous  feature  from  time  dependent  shifts 
it  criterion  or  from  other  tactors  which  have  a time  dependence.  The 
results  item  tests  designed  specifically  to  test  the  effect  of  time 
are  preserved  in  Secticn  8.2. 

Then,  in  otdet  to  explicitly  investigate  the  role  of  criterion 
on  the  outcome  of  discrimination  tasks,  the  results  of  slide  wire 
experiments  are  described  in  Section  8.3.  These  studies  allow  the 
mapping-out  of  performance  as  measured  fn  terms  of  P(C)  and  P(W),  the 
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probabilities  cl  a correct  or  wrong  decision,  as  the  detectability 
changes. 


8 . 2 Experimental  Variable  and  Response  Interaction 

In  earlier  chapters,  it  was  noted  that  there  is  an  order  effect 
which  seems  to  be  related  to  the  sequence  of  exposure  of  the  training 
signal  set  The  bias  was  observed  in  several  of  the  experiments 
involving  high  pass  liltering  of  the  sound  or  other  sounds  for  which 
the  dichotomous  learure  is  a low  frequency  band  of  noise,  and 
indicated  the  following  trends: 

a.  The  order  oi  exposure  A,  B followed  by  the  probe  A gave 
higher  values  cf  SNR  than  the  order  of  exposure  B,  A 
t llowed  by  the  probe  A. 

b-  The  order  of  exposure  A,  B followed  by  the  probe  B gave 
higher  values  cf  SNR  than  the  order  of  exposure  B,  A 
followed  by  the  probe  B. 

Ncte  that  the  significant  factor  was  not  whether  the  prcbe 
immediately  followed  the  corresponding  exposure  signal.  The 
determining  factor  seemed  to  be  in  which  order  the  exposure  signals 
were  presented.  In  these  cases,  A was  the  signal  pattern  containing 
the  dichotomous  feature,  and  B was  the  one  with  it  missing. 

The  explanation  tor  this  effect  is  not  at  all  clear  since  it  is 
difficult  to  conceive  of  a memory  organization  for  the  data  which 
wcuid  tav..r  this  type  of  behavior.  This  is  certainly  an  effect  which 
roust  be  contended  with  in  such  experiments  and  one  which  should  be 
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better  understood.  An  understanding  of  the  mechanism  responsible 
wculd  strongly  influence  subsequent  experiments  using  the  modified 
threshold  procedure. 

Another  consideration  m this  type  of  experiment  is  that  of  the 
time  factor.  In  studies  of  auditory  memory  span,  it  is  routine  to 
present  an  exposure  signal  followed  by  a shadowing  stimulus  prior  to 
the  probe  presentation  (Parkinson,  Parks,  and  Kroll,  1971),  The  effect 
of  the  shadowing  stimulus  is  quite  pronounced  when  tests  are  done  with 
isolated  letters  or  phonemes  m a non-rehearsal  situation.  The  low 
SNR  presentation  at  the  beginning  of  the  response  period  may  be 
considered  a shadowing  stimulus  and  this  may  lead  to  an  increase  in 
SNR  to  respond  as  time  to  respond  increases.  The  experimental  design 
includes  randomization  of  the  starting  SNR  to  reduce  this  effect  as 
well  as  to  avoid  listeners  becoming  conditioned  to  respond  after  a 
certain  time,  However,  the  SNR  and  time  are  still  strongly  ccr related. 
If  the  initial  SNR  were  fixed,  the  correlation  wculd  be  nearly  exact 
since  the  SNR  increases  with  time. 

The  design  of  the  experiment  was  such  that  the  starting  SNR 
values  cct.ld  take  on  any  one  of  five  Integer  dB  values  and  these  were 
■'hcser.  at  random  using  a uniformly  distributed  random  number  table. 

In  the  case  where  'he  SNR  increases  linearly  at  1/4  dB  per  second, 
the  correlation  coefficient  relating  SNR  to  time  is 

p - o^/  /oj  + 26.67  , (8.1) 


t 1 
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where  c is  the  -tandard  deviation  of  the  observed  time  to  respond  ^ 
Thi~  cot -elation  function  was  derived  by  noting  that  the  starting  SNR 
is  independent  oi  the  time  to  respond.  There  are,  therefore,  two 
independent  random  variables,  and  the  SNR  is  a function  of  both. 

When  the  standard  deviation  on  SNR  to  respond  is  Sr..m  with  a 

SNR 

rypical  value  ot  2.5,  the  value  of  will  be  approximately  10.  For 

this  value,  Equation  (8  1)  yields 

p = i / c*  + 26.67  = 0 89  when  O =10 

t t t 

This  is  a high  value  of  correlation  and  suggests  the  difficulty 
cf  separating  SNR  dependence  from  factors  which  effect  t directly. 

I'  was  ncted  that  sonar  operators  participating  in  the  modified 
thresh' Id  procedure  were  noticeably  disturbed  whenever  they  failed  to 
respond  m an  event-  The  observed  number  of  non-respcnse  events  was 
very  small  ter  this  group  of  subjects  as  a result.  It  appears, 
therefore,  that  these  subjects  were  assigning  a high  cost  to  deferring 
the  de  islcn  once  a certain  time  had  elapsed  during  the  response  period. 
Pric-  tc  r hat  time,  they  knew  that  the  likelihood  of  their  having  time 
re  respond  was  high  However,  as  time  passed,  they  were  mere  likely 
to  be  denied  an  opportunity  to  respond,  and  the  cost  for  the  deferral 
decision  would,  therefore,  change  in  their  view.  Such  a shift  of 
criterion  with  time  occurs  in  many  real-world  discrimination  tasks  as 
well  and  will  ccntcund  the  effort  to  measure  pertormance  ar  a fixed 


the  ccrrelattcn  function  was  derived  by  Mr.  D.  W.  Martin,  a tellcw 
graduate  student  assigned  to  support  data  reduction  of  related  sonar 
operator  data 


criterion  such  as  is  implied  in  the  instruction  "reasonably  certain  " 
loeb  and  Bintord  (1970)  investigated  some  of  the  dependencies  with 
time  ot  monitoring  tasks  although  their  time  scales  were  considerably 
longer  and  there  may  not  be  much  relationship  to  the  present  study. 

It  is  reasonable,  however,  to  conjecture  that  some  shift  in  criterion 
will  occur  as  the  response  period  progresses. 

The  large  correlation  between  response  time  and  SNR  will  result 
in  a downward  shift  in  SNR  due  tc  the  tendency  on  the  part  of  the 
listeners  to  lower  their  time  to  respond.  There  seems  to  be  no 
totally  acceptable  way  to  eliminate  the  shift  in  criterion  with  time 
since  making  the  response  time  itself  longer  is  also  undesirable 
because  the  listeners  become  restless  during  the  period  of  time  during 
which  the  signal  is  blanked.  Even  making  the  response  times  much 
lcr.ger  does  n&f  insure  that  subjects  will  not  shift  their  criterion  as 
is  shewn  by  Loeb  and  Bintord  (1970)  for  listening  sessions  lasting 
hour  s , 

To  investigate  the  effect  cf  time  and  shadowing  in  greater  detail, 
an  additional  experiment  (Experiment  XIII)  was  performed  using  the 
mcdilied  threshold  procedure.  For  this  test,  an  additional  30-seccnd 
delay  was  introduced  between  the  start  ci  the  response  period  and  the 
onset  of  ramping  ct  the  SNR.  Since  this  period  corresponded  tc  an 
essentially  noise-cnly  situation,  the  stimulus  during  this  time  served 
as  a masking  cne.  Events  for  which  this  additional  delay  was  present 
were  mixed  at  randem  with  those  where  the  ramp  started  immediately  and 
the  subjects  were  net  aware  of  which  events  would  have  this 
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characteristic-  Only  sonar  operator  subjects  were  used.  Unfortunately, 
these  added  events  corresponded  to  Experiment  IX  for  which  the  data 
exhibited  a pronounced  dependence  on  exposure  order.  When  data 
comparable  to  Treatment  IX. 4 was  used  as  a comparison,  however,  the 
results  indicated  that  the  addition  of  the  added  delay  reduces  the 
SNR  to  make  the  terminal  decision  significantly.  The  T statistic 
was  in  this  case 


1 = -3.19  fcr  Experiment  XI  vs  Experiment  IX 
Treatment  4 with  19  degrees  of  freedom 

This  value  of  T is  significant  at  the  0. 01  level.  Other  treatments 
ot  Experiment  IX  could  net  be  compared  because  of  insufficient  data. 
Also,  pooling  c-i  data  was  not  possible  because  of  the  exposure  order 
ei tec t . 

This  resul'  dramatically  demonstrates  the  effect  of  listener's 
shitting  their  criterion.  Not  knowing  that  this  event  would  last 
lenge  *han  ethers,  they  chose  to  respond  at  a lower  contidence  rather 
than  rc  possibly  not  respond  at  all.  In  retrospect,  the  design  of 
ths  experiment  pre-ordained  the  outcome.  However,  when  these 
experiments  *ere  designed,  there  was  no  reason  to  suspect  that 
liStene's,  especially  trained  senat  operators,  would  find  it  so 
ditficult  to  maintain  a criterion  stressed  repeatedly  in  the 
mst t uc t ions  to  the  subjects.  Feedback  in  such  tests  would  perhaps 
enter  e a subject’s  criterion  although  Robinson  and  Watson  (1972) 
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report  that  for  highly  trained  listeners,  feedback  might  actually 
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be  detrimental  Prior  knowledge  by  the  subject  of  which  events  have 
an  additional  shadowing  period  may  also  be  beneficial  since  the 
subjects  could  then  appropriately  adjust  their  concept  of  how  long 
they  have  till  the  response  period  may  end. 

Another  experimental  characteristic  unique  to  sonar  operator 
tests  was  that  the  exposure  signals  were  the  same  for  six  consecutive 
events-  Beth  the  signal  patterns  and  exposure  order  were  held  fixed 
and  the  exposure  set  was  only  presented  briefly  prior  to  another  event 
once  the  set  had  been  trained.  The  listeners,  therefore,  had  six 
opportunities  to  refresh  their  memory  of  the  signal  patterns.  In 
order  to  test,  for  learning  effects  within  such  a group  of  events,  SNR 
to  respond  was  compared  between  the  first  event  of  a group  and  the 
last,  between  the  first  and  fifth,  etc.  A statistically  significant 
difference  was  round  between  the  first  event  of  a group  and  all  others 
m Experiment  III  only.  This  finding  was  true  for  both  low  experience 
and  high  experience  operators  It  is  possible  that  the  marine  sound 
in  this  ra-e,  having  little  amplitude  modulation  or  other  readily 
discernible  characteristic,  was  very  nolse-like  and  was  easily 
confused  v r h the  ocean  ambient  sound  against  which  the  signal  pattern 
was  presented  in  the  response  period.  Lacking  a recently  reinforced 
men*  a I lm-ge  of  the.  background  sound,  the  first  event  resulted  in  false 
c lass i t lcat ions  at  low  values  of  SNR.  Subsequent  events,  however,  were 
tested  in  a situation  where  the  refreshed  signal  patterns  cculd  be 
mentally  compared  to  a rather  recent  exposure  to  the  background.  The 
subjects  did,  in  fact,  comment  that  they  would  have  liked  to  be  able 
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to  recall  the  background  noise  at  will  for  purposes  of  comparison. 

The  pre-recorded  nature  of  the  signal  sequences  precluded  such  a 
retlnemsnt,  however. 

The  above  described  intra-group  effect  does  not  explain  the 
previously  mentioned  dependence  on  exposure  order.  It  is  conceivable 
that  the  ordering  cf  events  within  a group  could  have  favored  certain 
comb inar ions  cf  exposures  and  probes  at  the  beginning  of  the  group  and 
ether  combinations  at  the  end.  However,  when  the  first  event  of  a 
group  is  eliminated  from  consideration,  the  exposure  order  effect 
persists  Also,  the  exposure  order  effect  occurred  in  most  experiments 
with  a band  of  noise  dichotomous,  whereas  the  intra-group  effect  was 
significant  in  only  Experiment  III. 

The  possible  effect  of  operator  fatigue  was  also  investigated. 
Since  sessions  consisted  of  18  to  24  events,  events  occurring  at  the 
end  ct  a session  may  be  biased  by  tiring  of  the  subject.  However, 
no  statistically  significant  difference  was  found  between  like  events 
associated  with  any  experiment  which  occurred  at  the  beginning  of  the 
session  compared  with  those  occurring  at  the  end  of  a session. 

Similarly,  no  inter-group  dependence  could  be  established  by  application 
of  statistical  hypothesis  tests  on  the  SNR  to  respond 

Based  cn  the  above  discussion,  the  effects  induced  by  the 
experimental  design  are: 

a There  is  a consistent  and  significant  effect  of  order  of 

exposure  for  tests  involving  dichotomous  bands  of  noise,  but 
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seemingly  not  for  other  signal  patterns. 
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b.  The  SNR  to  respond  is  strongly  correlated  to  response  time 
and  the  latter  is  influenced  by  an  apparently  strong  aversion 
to  letting  the  event  terminate  without  responding. 

c.  Uncertainty  about  the  sound  of  the  background  noise  in  which 
the  probe  is  presented  can  cause  the  first  event  in  grouped 
data  to  give  anomalous  results.  This  problem  is  apparently 
confounded  by  the  fact  that  two  similar  but  not  identical 
background  ncises  were  used  in  these  tests. 

d.  The  experimental  results  do  not  seem  to  depend  on  the 
position  of  an  event  within  a session. 

e.  The  effects  of  a masking  stimulus  or  of  short  term  memory 
decay  were  not  measurable  by  the  technique  used.  However, 
revisions  of  the  method  may  allow  for  measuring  these 
effects . 


8. 3 Results  Using  Continuous  Rating  Methods 

In  order  to  study  the  effect  of  criterion  in  mere  detail,  three 

experiments  using  the  slide-wire  technique  were  conducted  using  Navy 

personnel  only  Each  experiment  consisted  of  six  events  of  grouped 

data  as  previously  described.  Of  the  data  thus  collected,  only  the 

last  set  ct  data  using  the  sound  patterns  u)  and  u)ou_n  will  be 

2 2tiJU 

discussed  below-  The  other  data  was  obtained  after  the  subjects  were 


initially  interviewed  and  were  still  in  the  process  of  adjusting  to 


the  situation. 
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The  signal  patterns  and  order  of  the  exposure  set  corresponds  to 

Treatments  11  5 and  II. 6 listed  in  Table  7.  That  is,  the  signal 

patterns  were  presented  in  the  order  co  , oo  with  oj.  or  w 

Z ZtijU  Z ZHJU 

serving  as  the  probe  with  the  ratios  of  occurrence  of  the  two  being 
4 to  2,  respectively  A total  of  70  useful  events  were  collected 
using  both  groups  oi  operators,  those  with  extensive  experience  and 
those  with  relatively  little. 

The  data  was  output  to  a multi-channel  strip  chart  along  with  a 
voltage  proportional  to  the  SNR  set  on  the  balanced  mixer.  As  the 
listeners  adjusted  their  classification  and  degree  of  certainty,  a 
voltage  between  0 and  5 volts  resulted.  In  the  neutral  position, 
centered,  this  voltage  was  2.5  volts  and  decreased  with  increasing 
Com idence  A classifications  and  increased  with  increasing  Confidence 
B classifications- 

In  these  tests,  t.he  subjects  were  instructed  to  attempt  to  make 
a ' lassif icatlon  as  seen  as  possible  and  to  feel  free  to  change  their 
mind  See  also  Appendix  A where  the  text  of  the  instructions  is  shewn. 
The  subjects  did  in  iacf  revise  early  opinions  in  a number  of  instances 
as  mere  dota  became  available.  Prior  to  taking  data,  the  listeners 
wete  instructed  to  indicate  what,  in  their  opinion,  constituted  a 
decision  of  "reasonable  certainty"  and  this  rating  was  near  8 or 
higher  for  all  listeners.  Parallax  in  setting  the  slider  of  the  linear 
digit  pot ent lemet er  resulted  in  differences  of  + 1 on  the  rating  scale 
at  the  renrer  point  depending  on  the  subject.  This  parallax,  and 
carscness  of  the  rating  scale  contributed  a hysteresis  in  the  output 
of  about  one  digit. 


In  analyzing  the  data,  the  tentative  decision  was  noted  after 
every  2-dB  change  in  balanced  mixer  setting.  Furthermore,  decisions 
wete  scored  as  below: 

a Confidence  ratings  between  2A  and  2B  were  considered  as  no 
response . 

b.  Confidence  ratings  between  2 and  6 in  either  the  A or  B 
directions  were  noted  as  low  confidence  classification 
decision  of  A or  B,  respectively. 

c.  Confidence  ratings  exceeding  6 were  treated  as  high 
confidence  decisions  of  A or  B. 

Subsequent  analyses  of  the  results  considered  low  confidence 
answers  as  these  for  which  the  listener  indicated  either  a low 
confidence  or  a high  one,  i.e,,  for  which  the  subject  was  at  least 
minimally  committed  tc  an  A or  B classification  For  a high  confidence 
answer,  the  listener  had  tc  have  indicated  a confidence  level  of  6 
or  mere  If  lew  confidence  answers  are  used,  a greater  number  of 
both  correct  and  incorrect  classifications  are  expected  than  when  only 
high  confidence  classifications  are  admitted. 

Classifications  were  grouped  into  two  criteria  even  though  a 
'ont mucus  rating  scale  is  used.  This  grouping  is  made  because  a 
number  of  listeners  will  probably  net  apply  criteria  in  consistent 
enough  a fashion  to  allow  pooling  ot  data  which  is  resolved  into  mere 
categcries  Finer  categories  are  possible  for  a single  listener, 
tut  if  is  expected  that  different  listeners  assign  various  meanings 
to  numeric  ratings  (Robinson  and  Watson,  1972). 
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The  data  obtained  from  slide-wire  tests  is  a classification  which 
is  a function  of  the  SNR  and  the  criterion.  When  the  listener's 
rating  is  less  ceitain  than  the  specified  criterion,  no  matter  what 
his  tentative  decision  (A  or  B) , he  can  be  thought  of  deferring  his 
decision  as  described  in  Chapter  II.  It  is  assumed  that  a listener 
in  this  case  would  also  not  commit  to  a terminal  decision  which  he  had 
previously  been  instructed  to  make  only  when  a certain  criterion  was 
satisfied  The  present  subjects  indicated  that  a rating  of  about  8 
ccrresponds  to  the  verbal  instruction  "reasonably  certain"  during  the 
calibration  phase  of  the  session. 

The  observed  probabilities  of  various  event-response  categories 
are  related  to  the  detectabilities  of  the  signal  patterns  (SNR's)  and 
to  the  criterion  (Egan,  1975).  These  probabilities  define  the 
receiver  operating  characteristics  (ROC's)  in  these  classifications. 
Figure  39  shows  the  salient  features  of  an  ROC  curve  for  normal- 
normal  distributions.  The  isocriterion  contours  correspond  to  a 
listener  responding  to  signal  patterns  at  various  detectabilities 
while  maintaining  the  same  decision  criterion.  Shifts  of  criteria 
at  lixed  detectability  result  in  translation  parallel  to  the  chance 
diagonal  trom  one  isccriterion  contour  to  another.  In  order  to 
visualize  the  various  effects,  consider  the  case  where  the  hypotheses 
and  are  equally  likely.  Then,  whenever  the  criterion  is: 

Respond  either  uj  or  u)  in  response  to  the  stimulus. 

• • H • • 

L 


P( FA ) 


Figure  19.  Salient  Features  of  Receiver  Operating  Characteristic 
(ROC)  Curves  for  Normal-Normal  Distributions. 
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This  is  a lax  criterion  indeed  since  the  response  is  made  without 
necessarily  any  confidence.  When  the  detectability  cf  the  signal  is 
vanishingly  small,  the  probability  of  correctly  reporting  the  feature 
present,  i.e.,  P(S  |H^)  is  equal  to  the  probability  of  incorrectly 
reporting  the  feature  present  (P(w  |Hq)).  Since  there  will  be  a 
response  to  each  inquiry,  the  respective  probabilities  are  each  0.5 
since  the  feature  is  either  present  or  not  present  with  equal 
probability.  In  the  figure,  these  probabilities  are  marked  P(D)  and 
P(FA)  to  conform  to  common  practice.  As  the  detectability  increases, 
the  relative  probabilities  change  until  there  is  no  longer  any 
uncertainty,  and  the  presence  of  the  feature  is  always  detected  and 
its  absence  never  results  in  a positive  report. 

Similarly,  when  the  criterion  is: 

Respond  either  u)  cr  fi  whenever  reasonably  certain, 

both  probabilities  P(D)  and  P(FA)  will  be  low  but  equal.  As  the 
detectability  increases,  a listener  who  wishes  to  maximize  his 
performance  in  terms  of  P(D)  will  shift  the  absolute  value  of  the 
threshold  to  accept  more  false  positive  reports  while  increasing 
the  relative  probability  of  cc.rect  positive  reports  (Egan,  1975).  As 
the  detectability  increases  further,  the  value  of  P(FA)  will  start 
to  decrease  as  P(D)  centimes  to  increase. 

In  the  discrimination  task,  the  performance  indices  are  P(C), 
the  probability  cf  a correct  response,  and  P(W),  the  probability  of 


r 


an  incorrect  response. 


The  probability  of  a correct  response  is  given  by 


P (C)  = P(oi2|H1)  P(H:)  + P^2H30|H0)  P(Hq)  , (8.2) 


where 


P(6j 


2H30 


P(uS  |H1)  is  the  probability  cf  a correct  acceptance, 
|Hq)  the  probability  cf  correct  rejection,  and 

POy  = 2P(HQ),  P(cu2)  = 2P  (Uj2h3q)  = -66 


are  the  prior  probabilities  of  the  two  signal  patterns. 
Similarly, 


P(w)  = P(w2;H0)  P(H0)  + P(^2H3oIH1)  P(H1)  * (8.3) 

Note  that 


P(C)  -t  P(W)  ^ 1.0 

since  these  are  in  fact  conditional  probabilities  conditioned  on  the 
criterion  having  been  exceeded.  The  probabilities  gi-en  by  Equations 
(8.2)  and  (8.3)  may  serve  as  the  variables  when  plotting  the  ROC  curve 
and  will  exhibit  the  same  dependence  on  d’  and  criterion  as  do  P(D) 
and  P(FA)  in  the  detection  problem. 

Figures  20  and  21  show  the  experimental  behavior  cf  these 
probabilities  with  increasing  SNR.  figure  20  is  for  the  case  where  a 
strict  criterion  applies,  and  Figure  21  shows  the  results  cf  accepting 
tentative  decisions  which  satisfy  the  lew  confidence  criterion  or  the 
high  confidence  one  The  data  includes  the  90  percent  confidence 
intervals  on  the  estimate  cf  underlying  probability  from  the  observed 


performance.  Note  that  the  two  sets  ot  data  seem  tc  follow  the 
behavior  predicted  for  a listener  attempting  to  maximize  his  P(C). 

In  order  to  more  clearly  show  the  erfect  of  criterion,  the 
results  corresponding  tc  an  SNR  of  1.0  and  11.4  are  plotted  in 
Figure  22  for  the  two  criteria.  The  arrows  show  pure  translation  of 
the  point  estimates  of  P(C)  and  P(W)  parallel  to  the  chance 
diagonal  as  would  be  expected  by  a relaxation  of  the  criterion.  The 
observed  behavior  with  these  subjects  is  well  within  bounds  at  the 
90  percent  confidence  level. 

For  the  modified  threshold  procedure,  the  Navy  personnel 
responded  at  an  SNR  of  6.46  when  data  shown  in  Table  7 is  pooled 
The  criterion  in  this  case  was  "reasonably  certain."  and  this 
corresponds  to  ratings  in  the  slide-wire  tests  of  about  eight. 
Therefore,  the  data  collected  m Lhe  slide-wire  experiment  under  the 
strict  response  is  comparable  tc  the  modified  threshold  data  collected 
at  the  terminal  decision  point.  In  Figure  21,  the  approximately 
equivalent  SNR  data  is  shown  with  an  arrow  Note  that  this  datum 
occurs  at  the  point  where  the  PIC)  begins  tc  increase  rapidly  in 
relation  to  P (W) . 

It  is  also  noteworthy  that  the  value  of  P(C)  is  not  much 
different  from  the  value  ot  P(W)  at  this  SNR  In  fact,  under  the 
condition  that  a subject  responds  at  this  SNR,  the  expected  value  of 
P(C)  for  the  modified  threshold  procedure  is 


P(W) 

Figure  22.  Comparison  of  Experimental  Results  for  Strict  and  Lax 
Criteria  Discrimination  at  Two  Values  of  Signal-to- 
Noise  Ratio. 
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Pmt(C  | Response)  - PS„(C)  / (P^CO  ^ PSH(W» 

= 0.243/(0.243  t 0.143) 

= 0.63  , 

which  is  comparable  to  values  of  P(C)  observed  for  other  experiments 
using  high  pass  filtering  of  marine  sounds  such  as  Experiment  III. 

The  subscripts  mt  and  SW  above  are  used  to  differentiate 
between  the  modified  procedure  and  slide-wire  experiment. 


CHAPTER  IX 


SUMMARY  AND  CONCLUSIONS 


The  objective  of  the  experiments  reported  in  this  thesis  was  to 
quantify  the  subjective  response  of  listeners  to  noise-like  sounds  as 
are  encountered  frequently.  Such  sounds  play  a significant  role  in  a 
number  of  situations  ranging  from  the  control  of  the  speed  of  an 
automobile  by  the  driver  to  tasking  auditory  discrimination  tasks  in 
industrial  settings.  The  response  cf  listeners  to  differences  between 
sounds,  the  way  in  which  they  arrive  at  the  classification  decisions 
which  are  made  routinely,  and  what  the  significant  parameters  of  the 
sounds  are,  all  are  cf  both  theoretical  and  practical  interest.  The 
importance  of  this  information  increases  as  there  is  a greater  emphasis 
on  protecting  the  worker  from  excessive  noise  through  the  use  of  hearing 
protectors  and  other  means  cf  isolating  the  human  from  such  sounds. 

Also,  the  need  to  automatically  monitor  industrial  processes  places 
added  emphasis  on  knowledge  of  how  workers  are  able  to  interact  with 
such  processes  by  means  of  sounds. 

The  experiments  consisted  of  eliciting  discrimination  decisions 
from  listeners  trained  to  recognize  sound  patterns.  These  patterns 
were  bcth  marine  sounds  and  noise-like  stimuli  generated  in  the 


laboratory . 


Two  experimental  techniques  were  applied  to  populations 


I 


>, 


I' 
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of  listeners  consisting  of  trained  University  student  subjects  and  of 
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N:  y personnel  trained  in  sound  recognition.  The  experimental  results 
are  the  required  signal-to-noise  ratio  to  discriminate  between  two 
sound  patterns  and  the  associated  probabilities  of  correct  or  wrong 
classn ications.  In  addition,  a continuous  rating  scale  was  used  to 
obtain  more  detailed  information  about  the  receiver  operating 
characteristics  (ROC's). 

The  treatise  postulates  a model  of  the  discrimination  task  for  a 
set  of  signal  patterns  which  differ  by  either  having  or  not  containing 
a particular  auditory  feature,  a dichotomous  feature.  In  Chapter  II, 
this  model  is  presented  in  terms  of  a hypothesis  test  about  the 
presence  of  the  tearure.  It  is  suggested  that,  as  a first 
approximation,  the  discrimination  task  can  be  treated  as  a detection 
problem  where  the  signii leant  factor  is  the  detectability  of  the 
dichcromcus  feature-  Known  results  from  signal  detection  theory  are 
used  t obtain  the  optimum  detection  performance  (d ’ ) for  the  case 

opt 

where  the  dichotomous  feature  is  a band  of  noise  at  least  as  wide  as 
the  minimum  critical  auditory  band. 

in  Chapter  111,  this  model  is  expanded  to  treat  the  sequential 
r e r -gn J tier  problem  which  is  representative  oi  many  real-world  auditory 
recognition  tasks.  In  this  situation,  the  listener  will  defer  his 
decision  untii  more  information  becomes  available.  The  concept  of  a 
decision  criterion  is  introduced  and  it  is  seen  that  this  criterion 
plays  a significant  rcle  both  theoretically  and,  in  latet  chapters. 


in  expet iment  s. 
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The  experimental  techniques  used  to  investigate  the  listener 
performance  is  either  a modification  of  classical  threshold  techniques, 
the  "modified  threshold"  procedure,  or  a continuous  rating  method 
whereby  the  listener  indicates  his  degree  of  confidence,  the  "slide- 
wire"  experiment  The  techniques  and  the  various  experiments  performed 
are  described  in  Chapters  IV  and  V. 

One  of  the  principal  areas  of  investigation  was  that  of 
discrimination  when  the  dichotomous  feature  is  a band  of  noise.  This 
is  an  area  where  a considerable  amount  of  experimental  data  relating 
to  the  detection  performance  of  humans  is  available.  Also,  the 
investigation  is  motivated  by  findings  in  speech  recognition.  Chapter 
VI  details  seven  experiments  which  deal  with  this  question.  The 
dichotomous  feature  is  either  a band  of  noise  resulting  when  one  of 
the  twc  signal  patterns  to  be  compared  is  high  pass  filtered  or  is  a 
bandpasi  filtered  noise  when  laboratory  generated  signals  are  used. 

It  is  found  that  the  model  of  the  process  as  one  of  detecting 
the  dichctcmcus  feature  gives  good  agreement  with  experimental  results 
for  some  cf  the  sound  patterns  used.  The  listener’s  decision  to 
classify  the  sound  when  the  dichotomous  feature  is  present  is  in  all 
cases  made  at  a mean  s ignal-to-nolse  ratio  (SNR)  close  to  that 
predicted.  The  def ec tab il  1 ty  is  found  to  be 

8.6  < 10  log  d'  < 11.3  dB 

° opt 

with  the  probability  of  correct  classification  P(C)  averaging  about 
63  percent.  This  detectability  is  computed  using  the  effective 
bandwidth  c!  the  dichotomous  feature  and  an  assumed  integration  time 
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of  500  msec  The  predicted  value  of  d'  is  9.2  dB  at  a P(C)  of 

opt 

0.65.  When  the  feature  is  absent,  the  model  must  be  revised  to  account 
for  the  fact  that  the  listener  must  detect  and  act  on  other  features 
of  the  sound  which  he  uses  as  cues  to  decide  that  the  dichotomous 
feature  is  absent. 

In  Chapter  VII,  the  investigation  of  dichotomous  features 
concentrates  on  temporal  changes  in  the  sound.  The  temporal,  or 
envelope  structure,  is  known  to  be  important  in  speech  perception. 

The  experiments  used  sounds  which  differ  in  their  amplitude  modulation 
structure.  For  this  dichotomous  feature  the  agreement  with  the  model 
of  the  process  as  one  of  detecting  the  feature  is  also  found  to  be 
good  Where  the  required  increment  in  intensity  to  the  average 
(AI/l),  the  Weber  fraction,  is  predicted  to  be  -4  dB,  the 
experimental  results  are  in  the  range 

-4  3 < Al  / 1 <-3.3  dB  . 

in  spite  of  the  good  agreement  between  a simple  model  of  the 
process  and  the  measured  listener  responses,  the  experiment  identified 
some  significant  experimental  biases  and  differences  between 
populations  of  listeners.  Chapter  VIII  deals  with  listener  criterion 
and  response  biases.  The  criterion  which  the  listeners  use  for  making 
a tinal  classification  decision  is  strongly  influenced  by  the 
experimental  setting  and  their  training.  Significant  interdependence 
is  introduced  by  the  modified  threshold  technique  between  the  effect 
of  signal  detectability  and  that  due  to  time  into  the  response  period. 
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The  time  seems  to  strongly  impact  the  listener's  criterion.  The 
magnitude  of  this  effect  was  not  anticipated,  especially  for  the 
population  of  listeners  consisting  of  the  Navy  personnel,  and  some  of 
the  data  is  of  marginal  utility  as  a result.  Unexpectedly,  there  is 
a significant  difference  in  performance  in  the  discrimination  task 
between  the  student  subjects  and  Navy  personnel.  The  latter  group  of 
subjects  consistently  required  a higher  SNR  to  respond  and  were,  in 
certain  respects,  not  as  accurate.  A significant  difference  also 
exists  between  the  Navy  personnel  with  less  than  two  years  experience 
and  those  with  extensive  at-sea  exposure. 

The  author  set  cut  to  devise  an  experimental  technique  which 
could  be  used  for  the  purpose  of  quantifying  listener  performance  in 
a discrimination  task  and  to  apply  the  technique  to  a number  of  cases 
of  practical  interest  in  industrial  situations.  For  this  reason,  the 
data  is  in  some  instances  only  adequate  to  do  first  order  comparisons 
to  the  proposed  model  However,  good  agreement  is  found  between  data 
collected  by  others  in  very  specialized  detection  oriented  situations 
and  the  present  task  which  is  a complex  one  in  comparison. 

A number  of  areas  for  further  investigation  have  been  identified. 
These  include  tests  to  resolve  some  of  the  ambiquities  discussed  in 
Chapter  VIII.  The  relationship  of  continuous  rating  experiments  to 
results  obtained  from  the  modified  threshold  technique  is  one  area 
where  a large  volume  of  data  is  needed  to  decrease  the  uncertainties 
associated  with  the  various  probabilities  associated  with  event- 
response  alternatives  The  effect  of  exposure  order  on  the  results 
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obtained  for  experiments  with  dichotomous  bands  of  noise  is  an  area 
which  also  warrants  detailed  investigation  as  it  relates  to  the  utility 
of  such  an  experimental  technique  in  studying  the  discrimination 
process.  Although  the  modified  threshold  technique  has  been 
demonstrated  to  be  a relatively  quick  way  to  obtain  information  about 
the  terminal  decision  process,  the  strong  interaction  of  time  with 
the  subject's  criterion  is  a bothersome  factor.  A technique  is  needed 
which  will  allow  time  dependent  criterion  shifts  to  be  identified 
separate  from  the  effect  of  increasing  signal  detectability. 
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QUESTIONNAIRE  - FORM  1 

A STUDY  OF  RADIATED  NOISE  ( ) 


Note:  This  material  will  be  treated  as  personal,  please  answer  all 

questions  fully. 

Name: Grade: 

Current  Billet: 

Approximate  number  of  years  of  sonar  experience  

Does  this  experience  include  operations  involving 

surface  ship  contacts?  

List  recent  shore  based  sonar  schools  attended 

Year  Course  Where 


•k  k k * * 


Medical  History:  Do  you  now  have:  Ringing  in  the  ears  , A head 

cold  , perforated  ear-drum  . 

Have  you  had  in  the  past  the  following  symptoms? 


Symptom  (x) 

Often 

When  you  were 
young 

Recently 

Ear  aches  

. . • ( 

) 

< ) 

( 

) 

Discharge  from  the  ear 

. . . ( 

) 

( > 

( 

) 

Dizziness  

. . . ( 

) 

( > 

( 

) 

Trauma  due  to  explosion  

. . . ( 

) 

( ) 

( 

) 

Temporary  loss  of  hearing 
following  exposure  to 
gun-fire  or  other  loud 
noise  

( 

) 

( > 

( 

) 

Questionnaire  Used  with  Navy  Personnel. 


I 


t 


INFORMED  CONSENT  AGREEMENT  - 
PARTICIPATION  IN  A STUDY  OF  RADIATED  NOISE 


You  will  be  participating  in  an  experiment  designed  to  test  the 
ability  of  a listener  to  respond  to  changes  of  sounds.  The  test  will  be 
conducted  in  a quiet  enclosure  using  headphones,  and  the  loudness  with 
which  the  sounds  will  be  presented  will  be  carefully  controlled.  At  no 
time  will  the  sound  be  so  loud  as  to  cause  discomfort  but  if,  at  any  time, 
you  feel  uncomfortable  you  should  remove  the  headphones  and  leave  the  test 
enclosure.  Your  doing  this  will  not  be  used  as  a basis  for  discont inuing 
your  participation  in  the  test.  You  may  also  terminate  your  participation 
in  the  experiment  at  any  time. 

For  your  convenience,  test  sessions  may  be  scheduled  at  most  reasonable 
times.  However,  to  avoid  fatigue  and  to  keep  from  biasing  the  experiment, 
no  more  than  one  60  minute  test  session  may  be  done  in  any  24  hour  period. 

The  sounds  to  which  you  will  be  listening  may  or  may  not  be  familiar 
to  you.  The  method  for  recording  your  response,  the  order  in  which  signals 
are  presented,  and  the  details  of  the  experiment  will  be  explained  to  you 
prior  to  beginning  any  test  session.  If  you  have  any  questions,  please  ask 
for  clarification  by  the  experimentor . A memorandum,  "Prospectus  for  a 
Study  of  Radiated  Noise  as  a Communicative  Signal,"  is  available  for  you  to 
read  if  you  desire  to  know  more  about  the  overall  test  objectives,  its 
application  and  rationale. 


I, 


subject  signature 


have  read  this  document 


to  understanding.  Date 

Witnessed  by:  Date 


Informed  Consent  Agreement  for  University  Student 
Subj  ects . 


Ins  trunnions  appearing  on  the  r itsr  atrdic  'ape  to  whi'b  Navy 

personnel  ,»e.re  exp.  sed.  These  mi*  rue Mens  were  toll  wed  by  a 

question, 'answer  period  during  which  tine  all  j e:t£:rit;e‘  about  the 

experimental  technique  were  tc  be  resolved. 

You  will  be  par  t ipa’ _ng  m ar  experiment  designed  tc  test  the 
ability  ■:  i a listener  to  detect  changes  lr  sound  The  loudness 
at  which  f he  sounds  ate  presented  will  be  .artfully  rtf  rolled 
At  nc  time  will  the  sound  be  sc  loud  as  tc  . au?e  discomfort, 
but  ll  at  any  time  you  teel  uncomfortable  >•  u so:.  .la  remove  the 
headphones  and  make  this  fact  known  tc  the  res'  rdu  tor  Also, 
it  you  are  at  ell  uncertain  as  tc  the  te-t  procedures  you  should 
ark  f-  n r.laritieati'u. 

The.  scur.d;  you  will  hear  .11  oe  marine  r i.aes  wh‘<  h ma>  or  may 
not  be.  familiar  to  you  You  will  be  asked  t c 1-a  n rhes°  sounds 
ana  then  decide  which  is  presented  ir.  'he  r . ist  The  test 
sequence  will  c nsist  .>1  'w-i  signals  pi-evented  »:r.bcut  interfering 
noise  These  -ignals  will  be  dene  red  signal  A and  signal  b 
Signal  A will  "e  presented  ther  ignal  B The  signal  will  then 
be  epeared  A - B Ycu  will  t.h°n  oe  presented  signal  A or  signal 
B in.  a n :se.  This  .s  a -espinse  period  worth  « il  . be  indicated 
by  a green  ) igr.t  n rhe  re sp_n.se  re-  ,'dei  . The  .byf  t ive  is  to 
itid.ta'e  you-  dec  t s ; c-c  if  whether  tffc  unknowc  signal  is  signal 
A or  signal  B Ins  t .-iter ion  you  should  use  n matt m6  your 
deiii.on,  a well  as  'he  way  your  response  r.  oe  :raica»ed, 

will  car  y , You  wn  o).  way  a be  ns"i'rtd  at  he  beginning  .it 
a test  session  about  the  detailed  p coedvre  1 t b°  f llrwed.  For 
this  set  ces  c i experiments,  the  res- a a • e gat  zed  inre  groups 
or  everts  f-  - a!l  the  cert;  cl  a gi  up,  'he  signals  A and  B 
wiM  be  'be  same  Alte  iitst  being  p-e  <=n*ed  m he  order  A 
then  B,  and  '.-peered  In  the  sequent-  A - F,  they  a e :n.ly 
repeated  once  mor*  ' th*s  c der  ’ • «■  eSp,  v u<  ttem,.ry  cl  the 

sound.-  be.  .e  a-  'he.-  reap  n-e  pe  i.d  r-6  .ns  I be  sounds  A and 
B ate  presented  '-od.mly  In  'he  noi-e  <.;•>->  the  lik-librcd  ct 
signal  A being  ps-efed  t *-te  r,  be-.ng  e_,u=>'.  f * he 


I i 

*ei - h -od 

■t  s 

»ir  C . 1 

R be .ng  p- esenr  ea 

ir.  the 

nc 

:.s? 

An 

e xait.p  le 

Ct  1 

• iif 

-e.j_er  e 1 ol  lews 

.n  t h : 

! 3 

ca-,e, 

•he  amount 

ol 

n: . ;»  m. 

x‘  a 

*irh 

the  s.gnal  will  de 

rease  ■ 

aC 

1 OU  Will 

notice  'hat  'he  - end  is  at  tirst  masked  by  'he  noise.  As  the 

amount  ot  n .-e  hang?-,  it  w:  n b»-  oms  ea-ie  ••  hrat  rhe  sound 

Final.-.,  y-u  v be  able  to  deride  wi'n  teas  nobly  ertainty 
wh  r h . ,und,  A r P,  t was  which  was  presented  w *b  the  noise. 

Lister  ft-  thi>-  ^eqi.en  e and  try  ?,  dec  id  • wh:  h s und  is 

pr  e-er.'  ed  ’ n r te  n.  ' -e 


Instr actions  on  the  audio  tape  to  which  Navy  personnel  were 


■ 

exposed  prior  to  beginning  tests  using  the  slide-wire  continuous 

rating  response  recorders. 

For  this  experiment  the  response  recorder  will  be  used  to 
indicate  ycur  degree  of  certainty  about  which  sound  it  is  which 
is  presented  in  the  noise.  Initially,  the  indicator  should  be 
centered  on  zero.  The  amount  ct  noise  mixed  with  the  signal  will 
decrease  slowly.  Move  the  indicator  to  record  your  decision 
about  which  signal  is  the  unknown.  If  you  think  perhaps  that  the 
unknown  signal  is  signal  A,  move  the  indicator  up  the  scale 
towards  A.  The  further  up  the  scale,  the  mere  certain  you  are 
of  ycur  choice-  Try  to  make  a tentative  decision  as  soon  as 
you  can  and  do  not  hesitate  to  change  your  mind  as  more 
information  becomes  available. 

You  should  try  to  associate  a numeric  value  with  your  degree  of 
certainty.  For  example,  an  8 should  be  used  to  indicate  a 
reasonably  certain  decision.  Fcr  calibration,  now  indicate  a 
response  as  if  you  were  reasonably  certain  that  the  unknown 
signal  was  P Now  indicate  a low  confidence  of  a decision  of  A, 
and  a high  comidence,  very  certain,  of  a decision  of  the  unknown 
being  A.  A r *he  conclusion  of  an  event,  always  return  the 
indicator  t.c  the  zero  position  Reset  the  indicator  to  zero  now. 

Please  indicate  your  responses  for  the  following  events. 

Remember,  that  fcr  all  events  in  a group  the  signals  A and  B are 

rhe  same  Also,  try  to  maintain  the  same  numeric  value  for  your  Jj| 

criteria  and  recente  the  indicator  berween  events. 
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Instructions  recorded  at  the  beginning  of  all  auditory  tapes 

used  with  Navy  personnel  These  tests  ail  used  grouped  data. 

The  test  sequence  will  consist  of  two  signals  presented  without 
interfering  noise.  These  signals  will  be  denoted  signal  A and 
signal  B.  Signal  A will  be  presented,  then  signal  B.  The 
signals  will  then  be  repeated  in  the  sequence  A then  B During 
the  response  period,  which  will  be  indicated  by  a green  light 
cn  the  response  recorder,  either  signal  A cr  signal  B will  be 
presented  in  a ncise.  The  amount  of  noise  will  decrease  slowly 
The  objective  is  to  indicate  youi  decision  as  to  which  signal 
you  conclude  is  mixed  with  the  noise  Indicate  your  choice  by 
pressing  the  switch  marked  A if  ycu  decide  that  signal  A was 
mixed  with  the  noise,  or,  press  the  switch  marked  B if  you 
decide  that  signal  B was  mixed  with  the  ncise.  You  should 
indicate  this  decision  as  scon  as  you  can  under  the  condition 
that  you  are  reasonably  certain  of  your  choice.  For  this 
series  of  experiments  the  tests  ate  organized  into  groups  of 
events.  For  all  events  of  a group  the  signals  A and  B will  be 
the  same.  The  signals  A cr  B are  presented  randomly  in  the 
noise  with  each  being  equally  likely  Now,  please  indicate 
your  classification  decision  for  the  ft  1 lowing  cases. 
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Voice  comments  following  grouped  events  and  leading  into  another 

group  or  terminating  the  test  session. 

This  is  the  end  of  one  group  of  tests.  Another  group  of  tests 
follows.  For  this  grcup  the  signals  A and  B will  be  the  same. 
However,  these  will  generally  net  be  the  same  signals  as  in  the 
previous  group.  Please  try  to  learn  these  signals  without 
regard  for  other  signals  you  have  heard  during  this  experiment. 

This  is  the  end  of  another  group  of  tests.  In  the  group  of 
events  which  follows,  the  signals  A and  B are  the  same  throughout. 
These  signals  will  generally  not  be  the  same  as  those  heard 
previously.  Please  try  to  learn  these  sounds  independent  of 
other  signals  you  may  have  been  exposed  tc  in  this  experiment. 

End  of  another  group  of  events,  The  final  group  of  events 
follows.  For  this  group  as  in  those  before,  the  signals  A 
and  B will  be  the  same.  The  likelihood  of  signal  A being 
presented  in  the  noise  is  the  same  as  is  the  likelihood  of 
signal  B being  presented  in  the  noise- 

This  concludes  a test  session.  Thank  you  for  your  cooperation. 
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HARDWARE  SYSTEM 


The  tapes  used  as  the  source  of  the  auditory  stimuli  in  these 
tests  are  the  end  product  of  a carefully  controlled  series  of  taping 
steps.  The  marine  sources  of  interest  come  recorded  on  a large 
variety  of  machines,  at  various  tape  speeds  and  recording  techniques. 
Selected  portions  of  these  raw  source  tapes  are  re-recorded  onto 
1/2  inch  magnetic  tapes  in  FM  recording  mode  and  at  60  or  30  inches 
per  second.  During  the  re-recording  of  these  master  tapes,  adjustment 
is  made  for  hydrophone  or  other  frequency  weighting  appearing  on  the 
raw  data.  The  signals  are  also  prewhitened  by  an  amplifier  with  a 
6 dB  per  octave,  increasing  gain  vs  frequency  behavior. 

The  master  tapes  serve  as  inputs  to  any  stimuli  recordings 
which  use  actual  recorded  marine  sounds.  Alternately,  a weighted  noise 
source  may  be  substituted  for  either  member  of  the  exposure  set. 
Recorded  ocean  ambient  or  shaped  noise  serves  as  the  background  noise 
against  which  the  probe  signal  is  presented.  These  various  signals 
are  properly  time  multiplexed  by  means  ol  an  analog  selector  which 
receives  control  signals  from  a digital,  sequencer.  Figure  23  shows 
the  various  components  of  the  system  needed  to  create  primary  tapes 
from  master  tapes.  The  flow  of  analog  and  digital  signals  is  also 
shown.  In  Figure  23,  those  components  which  are  not  specifically 


SIGNAL  CONDITIONING 


Figure  23.  Flow  Diagram  Showing  the  Principal  Components  Needed 
to  Create  Primary  Tapes. 


— 


150 


indicate  l as  being  commercial  items  were  fabricated  at  the  Applied 
Research  Laboratory.  Most  of  these  components  were  designed  and  built 
by  the  author.  The  major  portions  of  the  system  with  the  exception 
of  the  analog  input  tape  unit  are  pictured  ir.  Figure  24.  Necessary 
interconnecting  cabling  has  been  omitted  from  this  photograph,  however. 

The  digital  sequencer  and  associated  controller  orchestrates 
the  required  functions  in  response  to  a simple  program  stored  in  a 
read  only  memory.  The  sequencer  starts  and  stops  the  output  tape 
drive,  sequences  the  various  audio  signals,  and  causes  the  generation 
of  conticl  signals  which  will  be  used  at.  the  test  site  to  indicate 
response  periods  and  to  record  responses.  The  exact  order  of  the 
process  is  determined  by  a program  selector  and  by  the  state  of  sense 
switches  Ihe  setting  of  the  balanced  mixer  is  performed  manually 
in  response  to  a cue  light  which  indicates  the  response  period  portion 
cf  the  audio  sequence. 

A direct  recoidea  monitor  channel  ot  the  output  tape  is  used 
for  voice  annotation  of  cut  numbers  on  the  primary  tapes  and  to  record 
a 12.4  kHz  pilot  tone  This  tone  is  on  during  valid  portions  of  data 
cn  an  FM  recorded  audio  channel  and  an  FM  recorded  control  channel. 

This  control  channel  has  tones  for  control  of  the  response  recorder 
and  a balanced  mixer  setting  proportional  VCO  output. 

Cuts  from  the  primary  tapes  are  re-recorded  onto  the  final  audio 
tapes  in  a randomized  order.  The  monitor  channel  of  rhe  primary  tape 
is  used  both  to  locate  the  desired  cuts  and  to  allow  for  voice 


Figure  24.  Photograph  of  tiie  Hardware  System  Used  to  Create 

Primary  Tapes  with  Interconnecting  Cabling  Omitted. 
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annotation  between  events  on  the  audio  tapes.  This  voice  annotation 
of  event  numbers  and  of  special  instructions  to  the  listener  is 
usually  from  a cassette  recording,  A phase  locked  loop,  analog 
selector  combination,  insures  that  the  objectionable  FM  discriminator 
noise  output  while  searching  the  primary  tape  does  not  appear 
on  the  tapes  used  for  listener  tests.  Figure  25  diagrams  this  aspect 


of  the  recording  process. 
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